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TRANSPORTERS AND ION CHANNELS 
TECHNICAL MELD 

This invaationrelates to nucldc acid and amino acid sequaices of transporters andionchannds 
5 andto11ieuseofthesesequencesinthediagnosis,trea1meDt,andpreveiittonaftt 

nmsde. and immunological disorders, and in tHe assessment of the effects of exogaioas conqHwnds on 
the expression of nucleic add and amino add sequences of transporters and ion channds. 

BACKGROUND OF THE INVENTION 
10 Enkaryotic cells are siOTOunded and subdiAdded into fuiK^ 

hydropliobiclipidbilayermembranesv^ttcharehighlyimpermeabletomo^^ Cells and 

organelles require Ixansport protdns to import and export essential nutrients and metal ions including 
r, mu\ Pi, SOA sugars, and vitamins, as wdl as various metabolic waste products. Transport 
> pr,iteinsdsoplayrdesinantiT)ioticr«sistance,-t^ synaptic neurotransmission. 

15 kidney function, intestinal absorption, tmnor growfli, and oflier diverse cefl functions (Griffith. J. and C. 
c„^,^ /I QQ«^ ThP Tv.n^nrter'Facts' BOok . Academic Press, San Diego CA. pp. 3-29). Transport 
canoccur by apassiveconcentration-dependejitmedianism. or can be linked to an energy source sudi 
as ATP hydrolysis or anion gradient Proteim that function in transport indude carrier prot^,wMch 
bind to a specific sdute andnndergo a conformational diangetiiat translocates the bound sdhite across 
20 themanbrane,anddianndproteins.wMdiformhydrc^hiUcporeslhataUowspedfi^ 
toough tiie meaiibrane down an dectrodiMnical solute gradient 

Carrier protdns which transport a single solute from one side of the membrane to tiie other 
are called uniporters. In contrast, coupled transport's link the transfer of one solute with 
simultaneous or sequential transfer of a second solute, dtiier intiie same diredion(symport) or hxtixe 
25 opposite direction (antiport). For example, intestinal and kidney epitiieliu^ 

symporter systems driven bythesodiumgradientlhatexistsacrosstijeplasmamemhrane. Sodhnn 

moves into tiie cell down its eledrochemical gradient and brings tiie solule hrt» tiJe cell with it The 
sodium gradient tiiat provides the driving force for solute uptake is maintahiedby the ubiquitous 
Na^/TATPase system. Sodium-coupled transporters indudetiieinammaUangJucose transporter 
30 (SGLTl).iodidetransporter(NIS).andmultivitamhitransporter(SMVT). AH teee transporters have 
twelve putative transmembrane segments, extracellular glycosylation sites, and cytoplasmicaHy- 
oriented N- and C-termini. NIS plays a crucial role in the evaluation, diagnosis, and treatment of 
various tiiyroid pathologies because it is the molecular basis for radioiodide tiiyroid-imaging 
tedmiques and for spedfic targeting of radioisotopes to the tiiyroid gland (Levy, O. et al. (1997) Proc. 
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Natl. Acad. Sd. USA 94:5568-5573). SMVT is expressed inthe intestinal mucosa, kidney, and 
placenta, and is implicated inthe transport of the waterlsoluhle vitamins. e.g.. biotin and pantothenate 
(Prasad, P.D. et al. (1998) J. Biol. Chan. 273:7501-7506). 

One of thelargest families of transporters is the major facilitator superfamily (MFS), also 
5 called the unipoiter-symporter-antipoiter family. MFS transporters are single polyp^de canters 
that transport smaU solutes in response to ion gradients. Members of the MFS are found in all classes 
of Uving organisms, and include transporters for sugars, oligosaccharides, phosphates, nitrates, , 
nucleosides, monocarboxylates, and drugs. MFS transporters found in eukaryotes all have a structure 
comprising 12 transmembrane segments Vao, S.S. et al. (1998) Microbiol. Molec. Biol. Rev. 62:1- 
10 34). Thelargest family of MFS transporters is the sugar transports family, which mcludes the seven 
glucose transporters (GLUT1-GLUT7) found in humans that are required for Hie transport of glucose 
and oflier hexose sugars. These glucose transpprt proteins haye unique tissue distributions and 
physiological fimctions, GLUTl provides many cen types with their has 
transports glucose across epilhelial and endothelial barrier tissues; ^ 
15 or efflux from the Uver,, GLUT3 regulates glucose supply to neurons; GLUT4 is responsible for 
insolin-r^ated glucose disposal; and CH-UT5 regulates fructose uptake mto skeletal muscle. 
Defects in glucose transporters are involved in a recently identified neurological syndrome causing 
infentae seizures and developments delay, as well as glycogen storage disease, Fanconi-Bickel 
syndrome, and non-insuUn-dependent diabetes mellitns (MuecMer, M. (1994) Eur. J. Biochem. 
20 219:713-725; Longo, N. and L J. Hsas (1998) Adv. Pediatt. 45:293-313). 

Monocarboxylate anion fransporters are FOttHi-coupledsynqxirterswilli a broad substrate 
spedfidty that includes L-lactate. pyruvate, and the ketone bodies acetate, acetoacetate, and 
beta-hydroxybutyrate. At least seven isoforms have been identified to date. The isoforms are predicted 
to have twdve transmentoane (TM) hdical domains wifli a large irtracefflular loop between TM6 and 
25 TM7. andplayacriticalrdeinmahitainingintracdtalarpHbyremovingtlieprotonsthatare 
stoichiametricany withlactate during glycolysis. The best diaracterized iT-monocarboxylate 
transporter is that of the erythrocyte membrane, whidi transports L-lactate and a wide range of other 
aliphatic monocarboxylates. Other cells possess iT-llnkBd monocaiboxylate transporters with differing 
substrate and inhibitor sdectivities. In particular, cardiac musde and tumor ce31s have fransporters that 
30 differ m their K„ values for certain substtates. including stereosdectivity for L- over D-lactate. and in 
their sensitivity to inhibitors. There are Na^-monocarboxylatecofransporters on the luminal surface of 
intestinal and Mdney q)itheUa, whidiaUow the uptake of lactate, pyruvate, and ketone bodies in these 
tissues, m addition, there are spedfic and selective fransporters for organic cations and organic anions 
in organs includingthekidney. intestine and Uva. Organic anion fransporters are sdective for 
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hydropliobic, charged molecules with dectron-attracting side groups. Organic cation transporters, such 
as the aramonimn transporter, mediate the secretion of a variety of drugs and mdogaious myetabolites, 
and contribute to the maintenance of intercdlular pH (Poole, R.C. and AF. Haleslrap (1993) Am. J . 
Physiol. 264:Cr761-C782; Price, N.T. et al. (1998) Biochem. J. 329:321-328; and Martindle, K. and I. 

5 Haggstrom (1993) J. Biotechnol. 30:339-350). 

ATP-binding cassette (ABC) transporters are members of a superfamfly of membrane proteuis 
that transport substances ranging from small molecules such as ions, sugars, amino adds, peptides, and 
phospholipids, to lipopeptides, large protdns, and complex hydrophobic drugs. ABC transputers 
consist of four modules: two nucleotide-binding domains (NBD), which hydrolyze ATP to supply the 

10 energy required for transport, and two membrane-spanning domains (MSD), each containing six 

putative transmeiribrane segno^. These four modules may be mcoded by a single gene, as is the case 
for the cystic fibrosis transmembrane r^ulator(CFTR), or by separate genes. When encoded by 
separate genes, each gene product contains a single NBD and MSD. These "half-molecules" form 
homo- and heterodimdrs, such as Tapl and Tap2, the endoplasmic reticulum-based major 

15 histocon5)atibillty (MHC) peptide transport syst^ Several gm^c diseases are attributed to defects in 
ABC transporters, such as the fdldwing diseases and their corresponding proteins: cystic fibrosis 
(CFTR, an ion channel), adrenoleukodystrqphy (adrenoleukodystrqphy protdn, ALDP), Zdlweger 
syndrome (peroxisomal menobrane protedn-70, PMP70), and hypmnsulinemic hypoglycemia 
(sulfonylurea recqrtor, SUR). Ovare^qiression of the multidrug resistance (MDR) protein, another 

20 ABC transportea:, in human canoor cells makes the ceais resistant to a variety of cytotoxic drugs used m 
chemotherapy (TagUcht, D. and S. Mlchaelis (1998) Mefih. Enzymol. 292:130-162). 

A numbo: of metal ions such as iron, zinc, copper, cobalt, manganese, molybdenum, selenium, 
nickel, and chromium are in5)Qrtant as cofactors for a number of razymes. For example, copper is 
involved in hanoglobin synthesis, connective tissue metabolism, and bone deveiopnient, by acting as a 

25 cofactor in oxidoreductases such as sup^xide dismutase, ferroxidase (cmiloplasmin). and lysyl 

oxidase. Copper and other metal ions must be provided in the diet, and are absorbed by transportors in 
the gastrointestinal tract Plasma proteins transport the metal ions to the liver and other target organs, 
whae specific transporters move the ions into cells and cellular organelles as needed. Imbalances in 
metal ion metabolism have been associated with a number of disease states (Danks, D.M. (1986) J. 

30 Med. Genet 23:99-106). 

Transport of fatty acids across the plasma membrane can occur by diffusion, a high capacity, 
low affinity process. Howeva*, under normal physiological conditions a significant fraction of fatty 
add transport appears to occur via a high afBnity, low capacity protdn-mediated transport process. 
Fatty add transport protein (FATP), an integral n^mbrane protdn with four transmembrane segments, 
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is expressed intissues exMbiting high levels of plasma inembrane fatty add fhix. such as masde. heart, 
and adipose. Egression of FATP is upregulated in 3t3-Ll cdls during adipose conversion, and ' 
expression in C0S7 fibroblasts devates nptate of long-diain fafly adds (Hni. T.Y. et al. (1998) J. 

Bid. Chsn. 273:27420-27429). 

5 Mitochondrial carria proteins are transmembrane-spanning protdns whidi transport ions and 

dmgedmetabolitesbetwe^thecytosolandthenutochondrialmatrix. ExaniplesindudetheADP. 
ATP carrier proldn; the 2-oxoglutarate/malate carrier; the phosphate cairiffprotein^^ thepyruvate 
carrier; the dicarboxylate carrier whidi transports mdate. succinate, fum^^^^ 
tricarboxylatecarrierwMditransportsdtiateandm£date;andtheGrave's*^^^^ 

10 protdnrecognizedbylgGinpatiaits wilbadiyeGrave'sdisease. anautoimmu^^ 

l^yroidism. Protdns mthis fenrily consist of threetandemrqjeats of an ap^oximatdy 100 annno 
add domain, eadi of v^dicontains two transmemhraneregiom (Stryer. L. (1995) 
. Reeman and Company. New York NY. p. 551; PROSITC PDOC00189 Mitodiohdrial enagy transfer 
protdns Signature; Online Menddianlnhetitance in Man (OMIM) *275000 Graves Disease). 

15 IWs class of transporters also includes the mitodiondrial uncoupling protdns. which create 

prdonleateaaosstheim,ermitodiondrijamembrane.tlmsuncoupli^ 

ATPsynthesis. THe result is energy diss^ation in the fonn of heat MitodK)ndrial uncoupling protems 
havebeen implicated as modulators of thermoregulation and mdabblic rate, and have been proposed as 
potential targets for drugs against metabolic diseases sudi as obesity (Ricquier. D. et aL (1999) J. M. 
20 Med. 245:637-642). 
Ion Channds 

The dedrical potential of a cdl is generated and maintained by controlling the movement of 
ions across the plasma membrane, itemovanentofionsrequiresiondiamids. whidifonnion- 
seledive pores witMn the membrane. Tliere are two basic types of ion diannds. ion trans^ 

25 gated ion diannds. Ion transporters utilize the energy obtained from ATP hydrolysis to activdy 

transport an ion against the ion' s concentration gradient. Gated ion dmnds allow pasave flow of an 
ion downtheion's eledrochemical gradient under restrided conditions. Togdher. thesetypes of ion 
d^amiels generate, maintain, and utilize an electroch^cal gradient that is used in 1) dedrical impulse 
condudion down the axon of a nerve cdl. 2) transport of molecules into cdls against concentraticm 

30 gradifids. 3) initiation of musde contradion, and 4) endocrine cdl seadioa 

Ton TnmsportMS 

lontransportersgenerateandmaintaintherestingelectricalpotentialofacdL Utllizingthe 
energy derived from ATP hydrolysis, they transport ions against the ion's concentration gradient 
ThesetransmfflnbraneATPasesaredividedmtothreefamiUes. Thephosphorylated(P)dassion 
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transporters, including Na*-K* ATPase, Ca^-ATPase, andH^-ATPase. are acUvatfidby a 
phosphorylalion event P-class ion transporters are responsible ftjr maintaining resting potential 
distributions such Hiatcytosolic concentrations ofNa^andCa^^arelowandcytosolicconcaitratiQnof 
K* is high. The vacuolar (V) class of ion transporters includes H* pumps on intracettular organdies, 
such as lysosomes and Golgi. V-class ion transporters are responsible «3r generating the low pH wttWn 
the lumen of these organdies that is required for function. The coupling factor (F) dass consists of IT 
pumps in the mitochondria. F-class ion transporters utilize a proton gradient to generate ATP from 

AD? and inorganic phosphate (Pj). 

•nieP-ATPases arehexamers of a 100 IcD subunit with ten transmembrane domams and 
seversa large cytoplasmic regions that may play a role in ion bhidhig (Scarborough. G.A. (1999) Cuir. 
OphL Cell Biol. 11:517-522). The V-ATPases are composed of two functional domains: the Vj 
domain, apet^eral conQ)lexresponablefor ATP hydrolysis; and the Vodomahi, an integral, 
complex responsible for proton translocation across the membrane. TheF-ATPases are structurally 
and evolutionamy related to the V-ATPases. Ihe F-ATPase Fo domain contains 12 copies of the c 
subunit. ahighlyhydr(5)h6bicprotdn composed of two transmembrane domains and containing a 
single buried carboxyl group in TM2 that is essential for proton transport The V-ATPase Vo domain 
contains three types of homologous c subunits with four or fivetransmMnbrane domains and the 
essential caiboxyl group in TM4 or TM3 . Both types of cowpl&x also contain a single a subunit that 
may be involved m regulating the pH dq)endence of activity (Forgac, M. (1999) J. Biol. Chem. 
274:12951-12954). 

The resting potential of the cefl is utilized in many processes involvmg carrier prot^ and 
gated ion channels. Cairiar protehis utilize the resthig potential to transport molecules hito and out of 
the cell. Amino add and glucose transport hito many cells is linked to sodium ion co-transport 
(symport) so that the movement of Na* down an dectrochemical gradient drives transport of the other 
molecule up a concentration gradient Shnilarly. cardiac musde links transfer of Ca** out of the cell 
with transport of Na* mto flie cell (antiport). 

flatPrt Tnn rhafinel.S 

Gated ion channels control ion flow by regulating the opemng and closuig of pores. The 
ability to control ion flux through various gating mechanisms allows ion diannds to mediate such 
diverse signaling and homeostatic functions as neuronal and endocrine signjding. muscle contraction, 
fertilization, and regulation of ion and pH balance. Gated ion diannds are categorized according to 
the manner of regulating the gating function. Mechanically-gated channds open thdr pores in 
response to mechanical stress; voltage-gated chamiels (e.g., Na^ K\ Ca^\ and Q" diannds) open 
thdr pores m response to changes in membrane potential; and Ugand-gated channds (e.g., 
acetylcholine-, serotonin-, and glutamate-gated cation channds, and GABA- and glycme-gated 
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ttsKOalioma moaUicatlons, such as phosptoryMon. 
5 M«J«-«aiy.g««lo,n,echa«>s«s>^^^ 

10 274:6330-6335). ^ „ 

T.»p^«>«^sab».i«cf me vonag^gat^lcaflon Channels fonnasup^^ 

<^p«eh.. T^char^^^cdo^lnora^-chaM^lprou^comprtsessix^nsm^^^ 

. • „o,™n.Mocate<ltemeenS3an<lS6,andii«racell»larafflmoai«i 

^ ^chash^ano^^c*. Ac«on^»«..ca-tonau«an«-.r*asea^^^^ 
c^acuo. a,i3e«™m-ge..ansl=-d=angesin««pe™ea«>i^-«»-*^^ 
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25 vo..age.ga».pc«assi™c— . C.^.V»^>--^<-^"^^ 
....UrizaUonofttem^Bbrane. Vo,uge.ga«<*annds»«B- charge resi*«»<^'^ 
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„,*e^an»p«=nm.. ,»cavaUon.n««a,«lh,me<^.sN-.en^,^»=«-'« 

30 matdo^aepcr. T^«a«ttoa.*o-anina^ve»ados«.s.a^re^.r««m«>.es».g 
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subunits conrdates with increased fiinctional ejqiression of the diannd, a change in its gating 
properties, as wdl as an increase in whole ceffl capacitance due to an increase in mfioihranesnrfece area 
asom, Li. et al. (1995) Ceia 83:433-442). 

Non voltage-gated Na* channds include the members of the amiloride-seDSitiveNa* 
5 chamel/degaierin (NaC/DEG) f amfly. Channel subunits of this family are ftought to consist of two 
transmembrane domains flanking a long extracellular loop, with the amino and carboxyl tOTomi located 
within the cdl. The NaC/DEG family includes the epithelial Na* channd (ENaC) involved in Nai" 
reabsorption in q)itheSia including the airway, distal colon, cortical collecting duct of the kidney, 'and 
exocrine duct glands. Mutations in ENaC result in pseudohypoaldosteronism type 1 and Liddle's 
10 syndrome (pseudohyperaldosteronism). The NaC/DEG family also includes the recently characterized 
H*-gated cation diannds or add-soishig ion channds (ASIC). ASIC subunits are eicpressed in the 
brain and form heteromnlthneric Na*-permeable channds. These diannels require acid pH fluctuations 
for activation. ASIC subunits show hamdogy to Hie degenerim, a family of mechanically-gated 
diannds originany isolated from C. d^ans. MutatiCHis in the degaierins cause neurod^aieratioa 
15 ASIC subunits may also have aioleinneunmalfimtian, or in painpercqjtion, since tissue acidosis 
' causes pain (Waldmann, R. and M. Lazdunski (1998) Curr. Ojm NeurobioL 8:418-424; Eglai, R.M. 
dL aL (1999) Trends PharmacoL Sd. 20:337-342). 

channels are located in all ceU types, and may be regulated by voltage, ATP concentration, 
or second messaigMS such as Ca^ and cAMP. In non-exdtable tissue, K* channds are hivolved in 
20 protdn synthesis, control of endocrine secretions, and the maMtMiance of osmotic equililsium across 
membranes. In neurons and otha- exdtable cells, in addition to regulating action potentials and 
rq)olarizing membranes, channds are responsible for setting resting membrane potential. The 
cytosol contains non-diffusible anions and, to balance this net negative diarge, the ceJl contains a Na*- 
K* pump and ion dianneas that provide the redistribution of Na+, K*. and CI'. The pun?) activdy 
25 transpcMtsNa*outoftbecdlandK*intothecdlina3:2ratio. Ion channds in the plasma msiihrane 
allow K* and CI ' to flow by passive diffusion. Because of the high n^alive diarge witWn the cytosol, 
CI ■ flows out of the cdl. The flow of K* is balanced by an dedromotive fiice pullmg K* into the cdl, 
and a K* concMitration gradiait pushmg K"^ out of the cdl. Thus, the restmg membrane potential is 
primarily regulated by K+flow (Salkoff, L. and T. Jegla (1995) Neuron 15:489-492). 
30 Potassium channel subunits of the Shaker -like superfamily all have the characteristic six 

transmembrane/1 pore domam structure. Four subunits combine as homo- or Heterotetramers to form 
functionja K channels. These pore-forming subunits also associate with various cytoplasmic P 
subunits ttiat alter channd inactivation kinetics. Ibe ShakCT -like channel family mcludes the voltage- 
gated K* channels as wdl as the ddayed rectifier type diannds such as the human ether-a-go-go 
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related gene (HERG) associated with long QT, a cardiac dysryflmia syndrome (Curran, M.E. (1998) 
Curr. Opin. Biotechnol. 9:565-572; Kaczarowski. GJ. and Ml. Garda (1999) Cm. Opin. Chem. 
Bicd. 3:448-458). 

A second superfamily of r channels is composed of Ihe inward rectifying channels (Kir). 
5 KirchannelshavethepropertyofpreferentianyconductingrcutrenteintheinwarddirectloiL These 

proteins consist of a single potassium selective pore domain and two transmembrane domains , which 
correspond to the fifth and sixth transmembrane domains of voltage-gated r channels. Kir subnnits 
also assodale as tetram^. The Kir family includes ROMKl. mutations in which lead 1« Bartter 
syndrome, a renal tnhnlar disorder. Kir chamiels are also involved in regulation of cardiac pacemaker 
10 activity, seizures and epilepsy, and insulin regulation (Doupnik, C A. et al. (1995) Curr. Opin. 
NeurobioL 5:268-277; Curran, supra) . 

Therecently recognized TWIKK^ channel family hicludes the mammalian TWIK-1, TREK-1 
and TASK proteuis. Members of this family possess an overall structure with four transmembrane 
domains and two P domahis. These protdns are probably Evolved in controlling the resting potential 
15 inalargesetofcdltypes(Daprat,F.etal.(1997)EMBOJ16:5464-5471). 

The voltage-gated Ca** channds have been classified into several subtypes based upon their 
electrophysiological andpharmaoflogicJdcharactfiristtcs. L^ypeCa^diannds are predominantly 
expressed in heart and sfcdetal rausde where they play an essential role in exdtation-cQntraction 
coupling. T-typechanndsarein5)ortantforcardiacpacemakeractivity.v*ileN-type 
20 chamidsarehivolvedinthecontrolofneurotransmitterrdeasehithecentralandperipha 

systaa The L-type and N-type voltage-gated Ca^channds have been purified and. though the^ 
functions differ dramatically. Ihey have sunilarsubunit compositions. The diamids are composed of 
threesubunits. The subunit forms the menibrane pore and vdtage sensor, while the o^S and p 
subunits modulate the voltage-dependence, gating properties, and the currait ampUtude 
25 Thesesubmiitsareencodedbyatleastsixai,oneai5,andfcurpgenes. A fourth subunit.Y. has been 
identified in skdetal muscle (Walker. D. et al. (1998) J, Bid. Chan. 273:2361-2367; McQesk^. E.W. 
(1994) Curr. Opin. Neurobiol. 4:304-312). 

Chloride channds are necessary in endocrine secretion and in regulation of cytosdic and 
organdlepH. In secretory q>ithelial cdls. CI" enters tiie cell across abasolateral membrane through an 
30 Na*. K^/a- cotransportfir. accumulating in the cell above its electrochemical equilibrium concoitration. 
Secretion of Q ' from the apical surface, in response to hormonal stimulation, leads to flow of Na * and 
watff into flie secretory lumen. The cystic fibrosis transmembrane conductance regulator (CFTR) is a 
dilaride dumnd encoded by the gene for cystic fibrosis, a common fatal genetic disorder in humans. 
CFTR is a merhbff of the ABC transporter femfly. tod is composed of two domains eadi consisting of 
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Six transmembrane domains foUowedby a mcleotid^binding site. Loss of CFTR function decreases 
transepitbelialwata secretion and. as a result, the layers of mucus that coat the respiratory tree. ' 
pancreatic ducts, and intestine are dehydrated and difficult to dear. The resulting blockage of these 
sites leads to pancreatic insufficiency, "meconimn ileus", and devastating "dironic obstructive 
5 pulmonary disease" (M-Awqati. Q. et al. (1992) J. Exp. Biol. 172:245-266). 

The voltagergatedcMoride channels (CLC) are characterized by 10-12 transmembrane 
domains, as well as two small globular domains known as CBS domains. The CLC subunits probably 
function as homotetramers. CLC proteins are involved m regulation of cell volume, membrane ' 
potential stabilization, signal transduction. andtransepitheJial transport. Mutations in CLC-1. 
10 expressedpredominanflyinskdetalnmde.areresponsiblefQrautosomalrecessivegm^ 

myotonia and autosomal dominant myotonia congenita. >vhile mutations in the Mdney diamiel CLC-5 
. lead to kidney stones (Jentsch. T.J. (1996) Curr. pim Neurobid. 6:303-310). 

Ligan*-gated diannds open theif pores when an ertracdlular or inttacdlular mediator binds to 
thedianneL Ncurolransmitter-gated diannds are diannds that open ^ a neurotransmitte 
15 thdrextracdliflardomain. ThesedianndsKdstinthepostsynapticmahhraneof 

Therearetwotypesofneurotransmitter-gateddiannds. Sodium diannds open in response to 
exdtatory neurotransmitters, sudi as acetyldK)line. glutamate, and serotonin. This opening causes an 
influx of Na* and produces the initial localized dqKflarizationthat activates the vdtage-gated diannds 
andstartstheadionpotentiaL Chloride diannds open hi response to inhibitory neurotransmitters, sudi 
20 asY-aminobutyricadd(Gi^A)andglydne,leadingtohype.polari2alionofthemmhr 
subsequent generation of anadionpotentiaL Neurolransmitter-gated ion diamids have fiiur 
transmembranedomainsandprobablyfunrtiOnaspentamers(Jentsdi.§m). Amino adds in the 
second transmembrane domain appear to be fanportant in detennhiing diannd permeation and 
sdectivity (Sather, W.A. et al. (1994) Cuir. Opia NeurdJioL 4:313-323). 
25 Ligand-gated channels can be regulatedby inttacdlular second messengers. For exan^ple. 

calcimn-activated channels are gated by internal caldum ions. In nerve cdls. an toflux of caldum 
during depolarization opens channels to modulate the magnitude of the adion potential GsM d al.. 
supraV Thelargeconductance(BK)channdhasbeenpurifiedfl:ombrainandilssubunitcon5)OSition 

determmed. Hie a subunit of the BK chamid has sevai rather than six transmembrane domains in 
30 contrasttovdtage-gatedK^chamiels. Ibe extra transmembrane domain is located at the subunit N- 
terminus. A28.amino-add strddiintheC-terminal region of the subunit (the "caldumbowl" region) 
contains many negativdydiarged residues and is thought to be the regionresponsible for calcimn 

binding. The p subunit consists of two transmembrane domains connededby a glycosylated 
exlracdlularloqp. with intracdMar N- andC-temrini (KaczorowsM. sjaaa; Vergara, C. dal. (1998) 
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Curr. Opin. Neurobiol. 8:321-329). 

CycUcmcleotide-gated(CNG)cbaimdsaregatedby(q^cfflc(^m^ Thebest 

examples of these are the cAMP-gatedNa* channds invo^^ 

channels involved in vision. Both systems involve ligand-mediated activation of a G-protein coupled 
recq)torwhichthenaltersthelevelof cyclicnucleotidewithinthecell. CNGchannds also represent a 
major pafliway for Ca^* entry into neurons, and play roles in nearonaldevdqpment and plasticity. 
CNG channels are telramers containing at least two types of subunits. an a subunit which can farm 
functional homommc channels, and a P subunit. which modulates the channel properties. All CNG 
subunits have six transmohbrane domains and a pore forming region between the fiflh and sixth 
transmanbranedamains.similartovoltag^gatedK^chamieis. A large C-tenninaldomahi contains a 
cyclic nucleotide bmdmg donuun. while the N-terminal domain confers variation among channd 
subtypes (Zufall, F. et al. (1997) Cuir. Opin. Neurobid. 7:404-412). 

The activity of ottier types of ion channd proteins may also be modulated by a variety of 
mlracdlularsignamngprotdns. Many chamiels have sites for phosphorylation by one or more protdn 
Idnases hicluding protein Idnase A, FOtein Mnase C. tyrosme kinase, and casdn ktoase H. all of 
whichregulateiondiamidactivityincells. Kir diannds are activated by the Wnding of tiieGPY 
subunits of heterolrimeric G-protdns (Rdmann, F. andF.M. Ashcroft (1999) Curr. Opht Cdl. Biol. • 
11:503-508). Otber proteins are involved hi the localization of ion channds to spedfic sites mtiie cell 
membrane. Suchproteins include tiiePDZ domainprotdns known as MAGUKs (membrane-assodated 
guanylate kinases) which regulate the clustering of iondiannds at neuronal synapses (Craven. S.E. and 
D.S. Bredt (1998) Cdl 93:495-498). 
Disease Corrdation 

The etiology of numerous human diseases and disorders can be aOributed to defects hi the 
transport of molecules across m^ranes. Defects in the trafBcking of membrane-bound transporters 
andiondiannds are assodated with several disorders, e.g.. cystic fibrosis, glucose-galactose 
malabsorption syndrome, hyperdiolesteroleima, von Gierke disease, and certain forms of diabetes 
mdlitus. Smgle-gene defect diseases resulting in an mability to transport small molecules across 
mmhranes mclude, e.g., cystinuiia, imhioglydnuria. Hartup disease, and Fanconi disease (vant Hoff, 
W.G. (1996) Bxp. Nq>hrol. 4:253-262; Taleate. G.M. et aL (1994) Ami. Intem. Med. 120:218-226; 
andChinon, M. d al. (1995) NewEngL J. Med. 332:1475-1480). 

Hmnan diseases caused by mutations in ion diannd genes include disorders of skdetal muscle. 
cardiacmusde.andthecentralnervous system. Mutations in the pore^formhig subunits of sodimn and 
ddoridediamids cause myotonia, a musde disorder in whidirdaxation after voluntary contradion is 
delayed. Sodium diannd myotonias have been treated with diannd blockers. Mutations ui muscle 
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sodium and caldTmdiaiu^ls cause forms Of periofflc paralysis. wMe^^^ 
calciumrdeasechamd.T-tubulecalcimnchaBnd, and musde sodium diannd cause malignaiit > 
hyperthermia. Cardiac arrythmia disorders such as thelong QT syndromes andidiqpatluc ventticular 
fibriUationarecausedbymutationsinpotassiumandsodium channels (Cooper,E.C. andL.Y. Jan 
5 (1998) Proc. Natl. Acad. Sd. USA 96:4759-4766). AD four known human idiopalMcepilq)sy genes 
codefor ioadiannd protdns (Berkovic S.F. and I.E. Scheffer (1999) Cmx. Opin. Neurology 12:177- 
182). Other neurological disorders sudi as ataxias, hemiplegic migraine and hereditary deafees^ can 
also result frommutations in ion dianndg«fis (Jen. J. (1999) Curr. Opin. Neurbbid. 9:274-280; 
Cooper, supra) . 

10 loudjanndshavebeenthetargetformanydrugthfir^ies. Neurotransmitter-gated chaniiels 

havebeaitargetedmtherapies for treatment of insomnia, anxiety, depression, and sduzqphrenia. 
V(dtage-gated dianneis have been targeted hi therapies for arrhythmia, ischemic stroke, headtramna. 
andneurodegenerativediseaseCraylor, CP. andL.S. Narashnhan (1997) Adv. Pharmacol. 39:47-98). 
Various dasses of ion diannds also play anmDortantrdeinthepercqrtionof pain, and thus are 

15 potentialtargetsfornewanalgesics.ThesehK:hide1hevaninold.gatediond^ 

bythevanilloidcapsaidn,aswdlasbynoxiousheat I^al anesthetics sudi as Udocahie and 
mexiletine which blod^vcfltag^gatedNa^ diannds have been useM in the treatment of neuropathic 

pain (Eglai, supra) . 

Ion diannds to the hnnrune system have recKJtly been suggested as targets for 
20 immunomodulaHoa T-cdl activation dq^ndsupcmcalcimnsignaltog, and a diverse set of T-ceil 

specificionchanndshasbeendwaderizedtimtaffedthissignalmgprocess. Channd bloddng agents 
caninMbitsecretionoflynq,hokmes.cdlproMeration.andkfllingoftargetc^^ A peptide antagonist 
of theT-cell potassimn channel kvl.3 was found to suppress ddayed-typehypersensilivity and 
allog^c responses m pigs . validating the idea of diannd blodsers as safe and efficadous 
25 umnunosuppressants (Calahan, M.D. and K.G. Chandy (1997) Curt. Opto. BiotedmoL 8:749-756). 

The discovery of new transporters and ion channds andthepdynudeotides encodtag than 
satisfies a need m the art by providing new compositions which are useful to the diagnosis, prevention, 
and treatment of transport, neurological, muscle, and hmnunological disorders, and tothe assessm^ of 
the dfeds of exogenous compounds on the expression of nuddc add and ammo add sequences of 
30 transporters and ioa diannels. 

SUMMARY OF THE INVENTION 
The tovention features purified polypq>tides, transport's and ion diamids, referred to 
collectivdy as "TOICH" andindlvddually as 'TRICH-l," "TOICH-2," "TOICH-3," "TOICH-4." 
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"TRICH-5." "TRICH-6," "TRICH-7," "TRICH-8," 'TRICH-9." 'TRICH-IO » 'TRICH-11," 
'TRICH-12," 'TRICH-13," "TRICH-14." 'TRICH-IS," «TRICH-16," 'TRICH-17." 'TRICH-IS," 
'TRICH-19." 'TRICH-20," "TRICH-21 " 'TRICH-22." •TRICH-23." 'TRICH-24." "TOICH-25." 
"TRICH-26," and 'TRICH-27." In one aspect, the invention provides an isolated polypq)tide 
5 comprising an amino add sequence selected from the group consisting of a) an amino acid sequence 
sdected from tbe group consisting of SEQ ID NO: 1 -27. b) a naturally occurring amino acid sequ«ice 
■ having at least 90% sequence identity to an amino acid sequence selected from the group consisting of 
SEQ ID NO:l-27, c) a biologically active fragmait of an amino acid sequence selected from the group 
consisting of SEQ ID NO:l-27, and d) aninmamogenic fragmrait of an amino acid sequence sdected 
10 fromthegroupc(HisistingofSEQIDNO:l-27. In one saternative, the hwention provides an isolated 
polypqrtide con^sing the amino add sequoice of SEQ ID NO:l-27. 

The invention further provides an isolated polynudeottde encodhjg a polypeptide comprising an 
ammo add sequence sdected from the group consisting of a) an amino add sequence sdected from the 
group consisting of SEQ ID NO:l-27. b) a naturally occurring amhio add sequence having at least 
15 90% seqaraice identity to an amino add sequence sdected from the group consisting of SEQ ID NO:l- 
27, c) a biologically active fragment of an ammo add sequence sdected from tiie group consisting of 
SEQ ID NO:l-27, and d) an hnmunogenic fragment of an anrino add sequence sdected from the group 
consisting of SEQ ID NO:l-27. In one alternative, tiie polynucleotide encodes a polypeptide sdected 
fromtiie group consisting of SEQ ID NO:l-27. In anoflier alternative, tiiepdynudeotideis sdected 
20 from tiie group consisting of SEQ ID NO:28-54. 

Additionally, tiie invention provides a recombinant polynudeotide comprismg a promoter 
sequence operably linked to a polynucleotide encoding a polypeptide comprising an amhio add 
sequence sdected from tiie group consisting of a) an amino add sequence sdected from tiie group 
consisting of SEQ ID NO:l-27, b) a natiirally occurring amino acid sequence having at least 90% 
25 sequence identity to an amino acid sequence sdected from tiie group consisting of SEQ ID NO:l-27, c) 
a biologically active fragment of an amino add sequence selected from tiie group consisting of SEQ ID 
NO:l-27, and d) an mramnogenic fragment of an amino acid sequence selected from ttie group 
consisting of SEQ ID NO:l-27. In one alternative, tiie invention provides a cell transformed witii tiie 
recambhuratpolynndeottde. In anotiier alternative, tiie invention provides a fransgenic organism 
30 comprisuig tiie reconibinant polynudeotide. 

The hivention also provides a mefliod for produdng a polypeptide comprising an anuno add 
sequence sdected from tiie grot?) consisting of a) an amino add sequence sdected from tiie group 
consisting of SEQ ID NO:l-27, b) a natiu-aHy occurring amnio add sequencehavmg at least 90% 
sequence identity to an amino add sequence sdected frran tiie group consisting of SEQ ID NO:l-27, c) 
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a biologically active fragmjent of an amino add sequence selected from the group consisting of SEQ ID 
NO:l-27, and (© an iimnunogeaoic fragmait of an amino add sequence sdected from the group ' 
consisting of SEQ ID NO: 1-27, The method comprises a) culturing a cdl under conditions suitable for 
ejcpression of tiie polypqptide> wherdn said ceil is transformed with a recombinant polynucleotide 
5 comprising a promote sequaice operably linked to a polynucleotide aicoding the polypeptide^ and b) 
recovering the polypeptide so expressed. 

Additionally, the invention provides an isolated antibody which specifically binds to a I 
polypq)tide comprising an amino acid sequence selected from the group consisting of a) an amino acid 
sequence selected from the group consisting of SEQ ED NO:l-27, b) a naturally occurring amino acid 

10 sequence having at least 90% sequence identity to an amino acid sequ^ce sdected from the group 
consisting of SEQ ID NO: 1-27, c) a biologically active fragment of an amino add sequence selected 
from the group consisting of SEQ ID NO:l-27, and d) an immunogenic fragmmt of an amino add 
sequence sdected from tlie group consisting of SEQ ID NO: 1-27. 

The invention fuitha: provides an isolated polynudeotide con5)rising a polynudeotide sequence 

15 sdected from the group consisting of a) a polynucleotide sequmce sdected from the group consisting of 
SEQ ID NO:28-54, b) a naturally occurring polynucleotide sequence having at least 90% sequence 
identity to a polynudeotide sequence selected from the group consisting of SEQ ID NO:28-54, c) a 
polynucleotide sequmce complooientary to a), d) a polynucleotide sequence complanentary to b), and e) 
an RNA equivalmt of a)-d). lu one alternative, the polynudeotide con5)rises at least 60 contiguous 

20 nucleotides. 

Additionally, tioe invention provides a method for detecting a target polynudeotide in a sanq)le, 
said target polynucleotide having a sequence of a polynucleotide conpdising a polynucleotide sequence 
sdected from the group consisting of a) a polynucleotide sequence sdected from the group consisting of 
SEQ ID NO:28-54, b) a naturally occurring polynucleotide sequence having at least 90% sequeice 

25 ideaSity to a polynucleotide sequence selected from the group consisting of SEQ ID NO:28-54, c) a 

polynucleotide sequence complementary to a), d) a polynucleotide sequence complementary to b), and e) 
an RNA equivalent of a)-d). The method comprises a) hybridizing the sample with a probe coniprising 
at least 20 contiguous nucleotides comprising a sequence complementary to said target polynucleotide 
in the sample, and which probe specifically hybridizes to said target polynucleotide, under conditions 

30 whereby a hybridization complex is formed between said probe and said target polynucleotide or 
fragments thereof, and b) detecting the presmce or absence of said hybridization complex, and 
optionally, if presmt, the amount thereof. In one altOTiative, the probe con:5)rises at least 60 contiguous 
nucleotides. 

The invaition furflw provides a method for detecting a target polynucleotide in a san?)le, said 
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SEQIDNa28.54,Wanaturanyocc«dBgpclymcleottdesequeKel«n^ 
i*,*,y«,apc.,n«leoadeseq««»sdecKdftomtegcoupc^ ■ 

ab«^<fs<Maa^«g=.pol,m*o«a»<«fn>gm««t^^ »A <5«ionaU,. itp«se«,to 
amount fhereof. , 

<.„^ an artno ada sevence ftcm 

«iecWS™«»8™^ccmiad,«c*SEQmNai.27,».»Uman,occu^ 

M,*«a.le.at9(»seqae««i*«l.y».aa»>i»«*>s.q»^ 
SBQmN01-27.o)abiol<*<^y«=ti«Sagm<tfafanantoacMseve««^»™° 
,3 <.««ngotSEQIDNO:^27,aoid)anln™«.gc*l^ 

SEQIDNOl-27. mlm-«*oaaaffltlonaBypiovkte.n«fl«l<rftreatogadl«as.oro^ 

assodaudwimtoxeasMcxpre^ofta.*^ 

20 need of such treatment the composition. 

TT«iavenUoaalsopr<md«ametW«»««e^acon>p<»^ 

agonist Ofapolypepuaecorvisingan^.dds.^se.^so-'I^S"-^'^ 
aM.oacMs«p»cesdec«dtomme^co-i^-SEQmNai-Z7.«a^oc^^ 

a,^ acid sequm. wag a. least 90% se.,^ U-% to "» «*• 

25 group oonsisti,^ Of SEQ ID NO:l-27, c) a bioiogican, acttve6agm« of an aoid«<pe«e 
s^fro«,h.^c««is.ingotSEQIDNO:1.27.a»l«.n,™«gedctoga««ofan^ 

acidseq«=a«ad««dftomtii.gro»pcomisUngotSEQIDNO:l-27.Th=»»«^ 
»:^as.fflpleoon,prisix«t.>epolyp.p.idet„acompo»»l,andb)*««^ 
«^ ta„^d«„^tl.invendo„provid=sacon^slttoacomprislnga..go«c^^ 
30 ,«iBrfl>yto«.hodai.l.pbam«=e„Uca]l,accepUibleexcipiaotIn^ 
toeaiaaprovi6«an»=«KxJoflreaangadi^o,co»Ji.iona^^ 
«^„„alTMCH,can^slng.amiBis.«i»g.oapa,ienti«.^otsuch,r«^^ 

AMttonaay. tie ia™*" pnMte a meOwd for screws a compo«.d for etfective.esa as 
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Of a) anamino addsequoicesdectedfronitbegroiip consisting of SEQ ID NO:l-27.b) a naturally 
occuiring amino add sequence having atleast 90% sequeoceidentity to anamino add sequence ' 
sdected from ttie group consisting of SEQ ID NO:l-27. c) a bidogicany adive fragmeat of an amino 
add sequence seleded from the group consisting of SEQ ID NO:l-27. and d) an hrannnogedc fragment 

5 of an amino add sequence sdected from the group consisting of SEQ ID NO:l-27. Hie metiiod 
comprises a) exposing a sample comprising the polypeptide to a compound, and b) detecting 
antagonist adivity in the sample. In one alternative, the invention provides a composition comprising 
an antagonistcompoundidentifiedbylhe method andaphaimaceuticaUy acceptable exdpienti In 
another alternative, the invention provides a method of treating a disease or condition associated with 

10 overexpression of ftmctional TRICH, conqjrising administermg to apatient in need of sudi treatment 
theconq)OSition. 

nie invention further provides a method of screening for a compound that specifically binds 
to a polypeptide comprishig an amino add sequence sdected from the group consisting of a) an amino 
add sequence sdeded fromfhe group consisting of SEQ ID NO:l-27, b) a naturally occurring amino 

15 add sequence having atleast 90% sequence identity to an amino add sequraice seleded from the group 
consisting of SEQ ID NO:l-27, c) a Wdogicany active fragment of an amino add sequence seleded 
from the group consisting of SEQ ID NO:l-27, and d) an immunogenic fragment of an amino add 
sequence sdeded from the group consisting of SEQ ID NO:1.27. The mdhod comprises a) combining 
the polypeptide with at least one test compound under suitable conditions, and b) ddecting binding 

20 of the polypeptide to the test conqwund. thereby identifying a compound that spedfically binds to the 
polypeptide. 

The invention further provides a method of screemng for a compound that modulates the 
activity of aptflypeptide comprising anamino add sequence sdeded from the group consisting of a) an 

amino add sequence sdeded from the group consisting of SEQ ID NO: 1-27. b) a naturally occurring 
25 ammo add sequence having at least 90% sequence identity to an amino acid sequence sdeded from the 
group consisting of SEQ ID NO:l-27. c) abiblogically active fragment of an annno add sequence 
sdeded fromtlie group consisting of SEQ ID NO:l-27. and d) an immunogenic fragment of an amino 
add sequence sdeded fromthe group consisting of SEQ ID NO:l-27. The method comprises a) 
combinmg the polypeptide wifli at least one test compound under conditions permissive for tiie 
30 activity of the polypeptide, b) assessuig the adivity oftte polypeptide in the presence of the test 

compound, and c) comparing the adivity of the polypeptide in the presence of the test con^KJund with 
the adivity of the polypeptide in the absence of the test compound, wherdn a diange in the activity of 
tiie polypeptide in the presence of the test compound is mdicative of a compound that modulates the 
activity of tlie polypeptide. 
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atetag«p«ssiono,a«rge.pol,m.*ottde,wi«einsaH«rg«pol,™^ 

5 eamKSSloiioftlietiitgetpolymideoliiSe. 

Ttein««tt<«fu«to[«>vlteaB»mod tor a^g went, ofate^compount^^ 

b)h*ndizbgmen»ddcacidsrftot«.KdW<aogical sample wimaprote 00^^^ 

,0 g^ap oomiatag of 1) a v^ymc^ ^ ^ ^ "^"^ 7 « 

oco^rl^polym^-^^^-ccMving atlea«90% se,^^ 

apd,™*oMese,»««»«^'«e^6omtt»gBn,poonsisU^ 

p„,i™,d««de seque^e oon*l<^ to i). iv) a poly^cleoUde se<pe.ce coMpleme^ » u), 
a„a,)anRNA«|Ul.al«torW H,W<llzatt<moccmsmde,conaittom whereby a specific 
15 l^S<«co>ap.exisft™ed,««».«>a.idpro.«»«lat.ge.poly.Md^^^ 
^ saia»ge.pol,>nK.e«U.e<».n^ apc.^*c««e»e^ 
e„„^ «i) ap61j™.cleolide8eq»e,« seleaca ftomtt^grocp comisdBg of SEQ ID N0.28-M, 
m.n«ManyoccMrtngp01yr»d«><iaesequencetovtogatleaa90%se^^ 
pd,„<le«i*se^sdecW6omfl»gra»pconsW»gafSEQID 

^ p„l,m.deoU*»^c.eo,.*.«^»0.1v)apoly™<^«q"=-« 

ana V) RNA equivatem of 1)-W). Mternattvdy. tbe «rge. poly^deodde corBpn- a m 
apol,i.udeattdese,»e»cesdecMftoma.eg«>^comMBg ibove; c) <pan«lyingfl.e 

a,^rfi,Mdiza«c«corapl«anad)c<»..pa.W 

s^ple,«be«lnaaifi^in«»«no.n.ofhyWdiz«ion<»m0exin«.efle^ 
is indicative of toxidly of IJietest compound. 
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BRffiF DESCRIPTION OF THE TABLES 

Tablel suinn^izeslhenaamaatareforlhefuniengthpolynud^^ 

30 sequaices of the present invoitian. ^ „ ^ 

Table 2 stows the Ge.,Ba*i*»imcaaoB»d« aBd anKWonotthenear^ 

tanaogfcrea^paiypeptideoftheinveBm mprob«.yscoret<>rthe,Mtd.b*«e.e«t> 

MlvoepttdeaiKi its CeoBanthomcilog is also shown. 

TaUe3sbowss,r.c«»alfea«resofeachpolypeptidese<pe««,i»audingp.^ 

33 aopair,s.aion8Witha>e,BeaKxis,al8ori.hn,s,aBdsearchaI^eda.*^ 
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polypeptide. 

Table 4 lists the cDNA and genomic DNA jfragments wMdi were used to assmible each ' 
polynucleotide sequence* along with selected jfragments of the polynucleotide sequences. 

Table 5 shows the representative cDNA library for each polynucleotide of the inventioa 
5 Table 6 provides an appmdix which describes the tissues and vectors used for construction erf 

the cDNA libraries shown in Table 5 . 

Table 7 shows the tools, programs, and algorithms used to analyze tiie polynucleotides and 
polypeptides of the invention, along with applicable descriptions, ref^ences, and threshold paraineters. 

10 DESCRIPTION OF THE INVENTION 

Before the pres^ proteins, nucleotide sequences, and methods are described, it is unda:stood 
that this invention is not limited to the particular machines, materials and methods described, as these 
may vary. It is also to be understood that tiie terminology used herdn is for tiie purpose of describing 
particular embodimeaits only, and is not intended to limit flie scope of tiie preset inv^on which will 
15 be Bmited only by the appended claims. 

It must be noted that as used herein and in flie appended claims, the singular forms "a," "an," 
and **flie" taclude plural reference unless the context clearly dictates otiiearwise. Thus, for exan5)le, a 
ref eraice to "a host cdl" includes a plurality of such host cells, and a referaice to "an antibody" is a 
ref er^ice to one or more antibodies and equivalents thareof known to those skilled in flie. art, and so 

20 fOTflL 

Unless defiboed otherwise, all technical and scientific teams used herdn have the same meanmgs 
as commonly understood by one of ordinary skill in flie art to which fliis invartion belongs. Although 
any maddnes, materials, and m^ods simflar or equivalent to those described herdn can be used to 
practice or test the pres^ invention, the prdOTed madiines, materials and methods are now described, 
25 All publications mentioned herein are dted fa: the purpose of describing and disclosing the ceil lines, 
protocols, reagents and vectors which are reported in the publications and whidi might be used in 
connection with the invention Nothing herein is to be construed as an admission that the mvration is 
not entitied to antedate such disclosure by virtue of prior invention. 
DEFINITIONS 

30 "TRICH" refers to the amino add sequences of substantially purified TRICH obtained from 

any spedes, particularly a mammalian species, including bovine, ovine, porcine, murine, equine, and 
human, and from any source, whether natural, synthetic, semi-synthetic, or recombinant. 

The team "agonist" refers to a molecule which intaisifies or mimics the biological activity of 
TRICH. Agonists may indudeprotedns, nucldc adds, carbohydrates, small molecules, or any other 
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con?>ound or composition wMch modulates the activity of TRICH either by directly interacting ^ 
TRICH or by acting on components of ibe biological patiiway in x^dx TRICH parti(^ates. 

An-allelicvarianfisanaltemativeformoftbegeneencodingTRICH. Allelic variants may 
result from at least one mutation inthenucldc acid secpenceandmayrestit in all^mRN^ 
5 pcflypqjtideswhosestnictureorfunctionmayormaynotbealtered Agenemayhavenone.one,^ 
xnanyandicvariantsofitsnaturallyoccurringform. Common mutational changes v^di give rise to 
alldic variants are generally ascribed to natural deletions, additions, or substitutions of nudeotides. 
Eadiof tbesetypes of dumges may occur alone, or in combination witiilhe others, one or more times m 
a ^ven sequotice. 

10 . "Altered" nuddc add sequences encoding TRICH include those sequences mth ddetions. 

insations. or substimtions of different nudeotides. resulting in a polypeptide the same as TRICH or a 
polypeptide^atleastonefimdionald^aderisticof TRICH. Included within this definition are 
polymorphisms v^d. may or may not be readily detedable using a particular oU^^^^^ 
thepdlynudeotideencodingTWCH.andimprop«oruneq,ededhybridizati^ 
15 alocusoflierlhanlhenormaldiromosomallocusforlhepolynudeottdesequenceencod^ 

Theeucodedprotdnmay also be "altered." and may contain ddetions. insertions, or substitutions of 
amino add residues which produce a silent diange and result in a functionally equivalait TRICH. 
Ddiberate andno add substitutions may be made on the basis of simflarity in polarity, diarge. 
solnbility.hydrc^,hobidty.hydr<^dty.and/ortheampWpatMcnam^ 
20 biologicalorimmmidoglcaladlvityofTRICHisretained.Forexan^le.negativelyd^^^^ 

acids may include aspartic add and glutamic add. andpositively diarged amino adds may indude 
lysine and arginine. Amino adds v^nndiarged polar side diains having similar hydiophiliaty 
values mayinclude:aq,aragineand^utamine;andserine and tiireonine. Amino adds witH- 
undiarged side chains having similar hydrophilicity values may indude: leucine, isoleudne. and 
25 valine; glycine and alanine; andphenyManine and tyrosine. 

The terms "anrino add" and "amino add sequence" refer to an oligopq?tide. pqrtide. 
pol>peptide. or protein sequence, or a fragment of any of these, and to naturally occurring or synthetic 
nK,lecules. mere "amino add sequence" is redtedto refer toasequenceofanaturallyo^^ 
proteinmolecule. "amino acid sequence" andliket«ms are not meant to limit the amino add sequence 
30 to the complete native amino add sequence assodated with the recited protdn molecule. 

"Amplification" relates to tiieproduction of additional copies of a nuddc add sequence. 
AnpMcationis generally carried out using polymerase dmnreadion(PCR)te^^^ 

intbeart. 

Hieterm-antagonist" refers to a molecule whidi inhibits or attenuates ti^ biological activity of 
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TRICH. Antagonists may include protdns such as antibodies, nucleic adds, carbdiydrates, smaU 
molecules , or any oilier compound or oamposition wWch modulates the activity of TRICH d&er by 
direcfly interacting witHTRICH or by acting on components of thebiologicalpafliway in whidiTRICH 

participates. 

5 The term "antibody" refers to intact imnmnoglobulin molecules as wdl as to fragments thereof, 

such as Fab. F(ab')., and Fv fragments, which are enable of binding an q)itopic determinant 
Antibodies that bind TRICH polypeptides can be prepared usmg intact polypeptides or using fragments 
ccHitaining small peptides of interest as the immunizing antigen. Hie polypeptide or oUgopq)tidei used 
toimmunize an animal (e.g., amouse. arat, or arabbit) can be derived from the lianslation of RNA, or 

10 synthesizedchemicany.andcanbeconjugatedtoacamerprotemifdesired. Commonly used carriers 
that are chemically coupled to peptides hiclude bovine serum albumin, tiiyroglobulin. and keyhole 
IhnpethemocyantaCKLH). The coupled peptide is then used to immunize the anhnal. 

The term "andgeoicdetenntoanf'refiers to tiiat region of amolecule(i.e., an epitope) tiiat 
makes contact withaparticular antibody. Whenapioteuiorafragmentof aprotdnisusedto 

15 hmnunizeahostanimal,numerousregiQnsoftheproteinmayinducetijeproductionofantib 

bmd spedfically to antigenic determinants (particular regions or three^mensional structares on the 
protdn). Anantigenlcdeterminantmay compete withtiieintactantigai(i.^,ti)ehnnmnD 

didt tiie imnrane respraise) fiir bhidhig to an antibo(ty. 

The term "antisense" rrfers to any oan5)OsitiQn enable of base-pairing withihe "sense" 

20 (codfaig)strandofaspedficnncMcaddsequence. AntisaisecompositionsmayhKludeDNA;^^^^ 
pq)tidenucldc add (PNA); digomideotides having modified badcbone linkages sudias 
phosphoroflnoates, mefliylphosphonatBS, or benzylphosphonates; oligomideotides having modified 
sugar groups sudi as 2'-meflmydhyl sugars or 2'-meflioxyethoxy sugars; or oUgonudeotides havhig 
modifiedbases sudias 5-methylcytosine.2'Kleoxyuradl.or7-deaza-2'-deoxygua^ Antisense 

25 moleculesmaybeproducedbyanymethodhKludingdiHnicalsynttiesisortranscr^ 

introduced into a cell, the complementary antisense molecule bas^pairs with a naturally occurring 
nucleic acid sequmce produced by the cdl to form duplexes whidi block dtha transcription or 
translatioa The designation "negative" or "minus" can refer to tiie antisense strand, and the 
designation "positive" or "plus" can refer to the saise strand of a reference DNAmoIecule. 

30 The term "biologically active" refers to a protdn having strudnral, regulatory, or biodJemical 

functions of a naturally occurring molecule. Likewise, "immunologically adive" or "immunogenic" 
refers to tiie capability of the natural, recombmant, or syntiidic TRICH. or of any oUgopq.tide thereof, 
to induce a spedfic mrnmne response hi appropriate animals or cdls and tobmd with spedfic 
antihodes. 
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«Con?)lementary" describes tberdationship between two stagLe^stranded nuddc add 
sequences that ameal by bas^pairiug. For e^mp^ 5'-AGT-3' pairs wHb its complanfint. 
3-TCA-5'. 

A "composition comprising a given polynucleotide sequeiicer and a "compositiQn ccanprising a 
5 given amino add sequence" refer broadly to any composition containing tbe givm polynndeoade or 
amiiK, add sequence. The composition may comprise a dry formnlationor an aqueous solution. 
Compositions con^rising polynucleotide sequences encoding TRICH or fragmente of TRICH may be 
employed as hybridization probes. Theprobes may be stored iiifireeze-dried form and may be 
assodatedwithastabilizingagentsuchasacarbohydrate. m hybridizations, the probe may be 
10 deployed in an aqueous solution containing salts (e.g.. NaCl). detergents (e.g.. sodium dodecyl sulfate; 
SDS). and other components (e.g.. Denhardt's solution, dry milk, salmon sperm DNA etc.X 

"Consensus sequence" refers to a nucleic acid sequence whidi has been subjected to rq)ealM 
DNAsequenceanalysis toresdlveuncanedbases. e^ctended using theXL-PCR Idt (PE Biosyst«ns. 
Foster City OA) in the 5- and/or the 3- direction, and resequenced. or whidi has been assembled from 
15 oneormoreoverlappingcDNA.EST.orgenomicDNAfragmentsusingacomputerprogramfor 

fragm^it assembly, sudi as tHe GELVIEW fragment ass^nbly system (GCG. Madison WI) or Phrap 
(University ofWashington. Seattle WA). Some sequences have been hotheaded and ass«nbled to 

produce tbe consensus sequence. 

"Conservative andno add substitutions" are those substitutions that are predicts to least 
20 interfere^ththeprc^erties<rftheoriginjaprotdn.i.e..thestructureande^^ 

protdnisconservedaMnotsignificaMydiangedbysudisubstitutions. The table bdow shows ammo 
adds whidi may be substituted for an original amino add hi a protdn aud wMdi are r^arded as 
conservative amino add substitutions. 
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Original Residue Consexvative Substitution 



Arg H^'Lys 
Asn Asp.Ghi,His 



Asp 
Cys 



Asn, Glu 
Ala, Sa: 



30 



Asn, Glu, His 

du Asp, Gin. His 



Gly 



Ala 
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jji^ Asn, Arg, Gin, Glu 

jjg Leu,Val 

Leu ^^^^ 

Lys Arg. Gto, Glu 

Met Leu, lie 

pijg His,Met,Leu,Trp,Tyr 

Ser Cys.Thr 
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Thr Ser, Val 
Tip Pbe, Tyr 
Tyr His, Phe, Trp 
Val lie, Leu, Thr 



Conservative amino add substitutions generally maintain (a) the structure of tliepolypq)tide 
backbone in the area of the substitution, for exaii5)le, as a beta sheet or alpha helical conformation, 
(b) the charge or hydrophobidty of the molecule at the site of the substitution, and/or (c) the bulk of the 
side diain. ' 

10 A "deletion" refers to a change in the amino add or nudeotide sequeaice that results in the 

absence of one or more amino acid residues or nucleotides. 

The term "derivative" refers to a chemically modified polynucleotide or polypeptide. Chemical 
modifications of a polynucleotide can include, for example, replacement of hydrogen by an alkyl, acyl, 
hydroxyl, or amino group. A derivative polynucleotide ^codes a polypeptide which retains at least one 

15 biological or immunological function of the natural molecule. A derivative polypeptide is one modified 
by glycosylation, pegylation, or any similar process that retains at least one biological or immunological 
function of the polypeptide fi-om which it was derived. 

A "detectable label" refers to a report©: molecule or enzyme that is capable of gen^ating a 
measurable signal and is covalaifly or noncovalently joined to a polynucleotide or polypeptide. 

20 A "fi-agment" is a unique portion of TRICH or the polynudeotide encoding TRICH which is 

identical in sequence to but short& in length than the parent sequence. A fragment may comprise up 
to the entire length of the defined sequence, minus one nucleotide/amino add residue. For example, a 
fragment may comprise from 5 to 1000 contiguous nudeotides or amino add residues. A fragment 
used as a probe, primer, antigen, therapeutic molecule, or for other purposes, may be at least 5, 10, 

25 15, 16, 20, 25, 30, 40, 50, 60, 75, 100, 150, 250 or at least 500 contiguous nucleotides or amino acid 
residues in length, R-agments may be preferentially selected from catain regions of a molecule. For 
example, a polypeptide fragment may comprise a certain length of contiguous ammo adds selected 
from the first 250 or 500 amino acids (or first 25% or 50%) of a polypeptide as shown in a certam 
defined sequence. Clearly these lengths are exemplary, and any length that is supported by the 

30 specification, includhig tiie Sequence Listing, tables, and figures, may be encompassed by tiie present 
embodiments. 

A fragment of SEQ ID NO:28-54 comprises a region of unique polynucleotide sequence tiiat 
specifically identifies SEQ ID NO:28-54, for example, as distmct from any other sequence m tiie 
genome from which the fragment was obtained, A fragment of SEQ ID NO:28-54 is usefifl, for 
35 example, in hybridization and amplification technologies and m analogous methods that distinguish 
SEQ ID NO:28-54 from related polynucleotide sequences. The predse lengtti of a fragment of SEQ 
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IDNO-28-54and1heregionofSEQIDNO:28-54towMdi1teftagm«itcorres^^ 

detemiinableby one of ordinary skminlheartbasedontheinteadedp^ 

AfragmentofSEQIDNO:l-27isencodedbyaflragmentofSEQIDNO:28-54.Afragment 

of SEQ ID NO - 1-27 comprises a region of unique amino acid sequence that spedfically identifies 
SEQ©NO:l-27.Forexample,afragmentofSEQIDNO:l-27isuseMasanimiimogenicp^^ 

for the development of antibodies that specificaHy recognize SEQ ID NO:l-27. The precise length of 
afragmenlofSEQIDNO:l-27andtheregionofSEQIDNO:l-27towhichthefragment 

cc^respondsareroutinelydetenninableby one of ordinary sldilintheartbas^^ 

purpose for the fragment. 

A'^failength-polynndeotidesequeaceis one containing at leastatransktim 

(e.g.. meOuonme) fifflowedby anopenreading frame and a translationter^^^ 
length" polynucleotide sequence encodes a "full length" polypeptide sequence. 

"Homology" refers to sequence similarity or. interchangeably, sequence identity, between two 
or more polynucleotide sequences or two or more polypeptide sequences. 

Thetems '•perceat identity" and«% identfly." as applied to polynucleotide sequences, refer to 
Ihepercentage of residue matches betweea at least two polynudeottde sequences aligned using a 
standardized algorithm. Suc^ an algorithm may insm. in a standardized and reproducible way. gaps m 
. ihesequencesbdngcomparedinordertoqjtimizejdigmnentbetweentwos^ 

achieve a moremeanmgful con5)arisonof thetwo sequences. 

Percent identity betweenpolynncleotide sequences may be determuiedusto^ 

parameters of IheCLUSTALValgoritiunasincorporatedhitotheMEGALIGNv^^ 
aligmn^FOgtam. This program is part of the LASERGENE soflwarepackage. a suite of molecular 
biological analysisprograms (DNASTAR. Madison WI). CLUSTAL Vis described m Higgins. D.G. 
andP.M. Sharp (1989) CABIOS 5:151-153 andinffiggh^. D.G. et al. (1992) CABIOS 8:189-191. 
For pairwise ahgmnents of pcflynudeotide sequences, the default parameters are set as fono^^^ 
Kmple=2. gap penalty=5. wmdow=4. and "diagonals saved"=4. m "weighted" residue weight tables 
selected as the default Percerrt identity is it^^rted by CLUSTALV as the "percent simiW^ 

aligned polynucleotide sequ^ces. 

Alternatively, a suite of commoiily used and fredy available sequence comparted 

, providedby theNational Center for Biotechnology Information (NCBD Basic I^alAligmnentSeardi 

Tool (BLAST) (Altsdiul. S.F. et al. (1990) J. Mol. Biol. 215:403-410). whid. is available from several 

sources, mduding the NCBI, Bethesda, MD. and on the Internet at 

ht(pV/www.ncbi.nhn.nih.gov/BLAST/. The BLAST software suite includes various s^^ 
programs mduding "blastn." that is used to align a known polynucleotide sequ«K» with other 
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pC^cleoddesequencesfroxnavarietyofdatabases. ^avail^leis atdolc^"BLAST2 ^ 
Se^es"matisusedfor<^^pair^ecomparisonoftwo».dec.de^^^^ 
Secpences-canbcaccessedandusedMeracUvdyathttpy/w^^ ^ 
"BLAST 2 sequences" tool canbeusedfor bothblastn andblasip (discussed below). BLAST 
5 prograB.areco^Blyusedv.ti.gapando^aparan.etassettodefaultse^^^ 
Uaretwo^c^ec^dcsecpe^ces. onexnayuseblasmwim^e"^^^^^ 
2.0.12(ApriI-21.2000)setatdefa^paramet^. Sudrdefa^tparainetersmaybe.forexan?.le: - 

Matrix BLOSVM62 
Reward for mooch: 1 
10 Penalty for mismaich: -2 

Open Gap: 5 and Extension Gap: 2 penalties 
Gc^x drop-off: 50 
Ejq>ect: 10 
Word Size: 11 

15 Filter: on , „^ 

,e«t30. atleas.40, ,,^10. a.teas,I0O.o.alm.20O««WK».».^. S-^. 

percentage idMitity may be measured. 

thatanencixtesubstantlallyliKSMiiepiotein. 
„,toCLlOTALVilgori«mastaM^i«o«»MBOAUG^ 
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CLOSTAL V,te«Wtpara™=K. are set as Mows: l^t^i.^"^- 

p^c.ec«^a>*^.*ep«c««.^.yisrep<^*yCLUSTALV,s«.'*<x^ 

5 betweoiJdignedpolypejJtide sequence pairs. 

Altemativdy theNCBI BLAST software suite may be. s«i For example, for apa^ 
oo.^ansonaftwopclypeptiaese^e^.OBemayuse^e"BI^T2Se.pen^^^ 
(Apnl-21.2000)withblaslpsetatdefaultparame.as. Sucb default parameters may be. for example 

Matrix BWSVM62 
10 Opera Gap: ii on^ Extension Gap: 1 penalties 

Gap X drop-off: 50 
Expect: 10 
Word Size: 3 

Filter: on w 

^ over -.lengO-of a6a^«^<«n»...g». ^V^y^^ 
a»ag„,e„.ota.leas.l5,a..eas.20..tleas.30.at,eas,40.a.,eas.50.a.le.st70cr«..^50 

oo^^ousre^^es. S.c.le^»e«a*y,.-it'^.»^<"-»»^'»^ 

desaibealenEmover>*ichp=rc««agei(Je«ilyii»flrtel«a«ared 

artillcial cbromosomes" (HACs) arelimar nmo^m^ «u<J.inay c<«a>n 
DNA seque^e. Of about 6 icb 10 Mb in Size aM c««ain an =t fl» de.n»« «^ 

clirMiiosome rtjilicatioii, segr^ation and maintenance. 
j5 Tle.erm-««danabodrreters.oanan.ibodyn«*c^in«>«.t.»»nb^ 

,e,^in,beno,..n.igenbi»iinj regions has been a»eredso.b««»»<ibo* 

resaiiBes a iMoan aMbody, and slin retaim its original binding aM^^ 

.B,bridlz.tion"ret«sto.l.eprocessby«l>icliapolynucleottdes.r.«la,»«ls«^ 

.^Sto^tarystraridSwagbbasepairir^n^^ 
30 ^^.sani»«c«lc»r.«^.wo.«^aoidse<pe.«es^aMg.degreeofcom^^^ 

specific b,b.W-.lonoon^fo.mtn*rp»nlssivean«aling conditions andre^ 

atoBe-wasUnrsttfCs). •Ile™^s.ep(a)ispardc*lyin.pcr»'-«--^«-~=' 
„fthel«bndizaaonproc«s.«im««strin^condiBonsaIto^ 

tWtogbet«=eapalr,o(Mcleio.cids,..»fct.«..re.««p«fecayn«*=i Pexmi^ive conditions «or 
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amealiiig Of middc add sequences areroutimdy detennimbleby one erf ordinary skill intiie art and 
may be consistesnt among hybridization experiments, \<*ereas wash conditions may be varied among 
experimoitsto achieve the desired stringency, and therefore hybridization specificity. Pennissive 
annealing conditions occur, for example, at eS^C intiie presence of about 6 x SSC, about 1% (w/v) 
SDS, and about 100 |ig/ml sheared, denatured salmon sperm DNA. 

Generally, stringency of hybridization is expressed, in part, with reference to flietemperatnre 
nnder which tiie wash step is carried out Sudi wash temperatures are typically selected to be ablout 
■ 5**C to 2(fC lower than tiie tiiermal melting point (T J for tiie spedfic sequence at a de^ed ionic 
strength and pH. The T„ is tiietranperature (under defined ionic strength and pH) at which 50% of the 
target sequence hybridizes to a perfecfly matdied probe. An equation for calculating T„ and conditions 
for nucldc add hybridization are wdl known and can be found in Sambrook. J. et al. (1989) Molecular 
nmingr A LahnratoTv ManuaL 2^ ed.. vol. 1-3, Cdd Spring Harbor Press; Plainview NY; specifically 

see volume 2, diapter 9. 

High stringffljcy conditions for hybridization between polynudeotides of tiie present invention 
mclnde wash condifions of 68°Cmtiie presence of about 0.2x SSC and about 0.1% SDS, for 1 hour. 
Mtemativdy.toiqieratures of about 65°C, 60»C.55°C,or42°Cniaybeused. SSC concentration may 
be varied from about 0.1 to 2 x SSC, witti SDS being present at about 0.1%. Typically, bloddng 
reagents are used to block non-spedfic hybridization. Such blocking reagraits include, for instance, 
sheared and denatured sahnon sperm DNA at about 100-200 jig/mL Organic solvent, sudi as 
formamide at a concentration of about 35-50% v/v. may also be used under particular circumstances, 
such as for RNA:DNA hybridizations. Usrful variations on tiiese wash conditions will be readily 
apparent to tiiose of ordhiary skill in tiie art Hybridization, particularly under high stringency 
conditions, may be suggestive of evolutionary shnilarity between tiie nudeotides. Such shnilarity is 
stiongly Indicative of a similar role for the nucleotides and their encoded polypeptides. 

The term "hybridization complex" refers to a conrplex formed between two nucldc add 
sequences by virtue of tiie formation of hydrogm bonds between complranentary bases. A hybridization 
conqilex may be formed in solution (e.g., Cot or Rot analysis) or formed between one nuddc add 
sequoice present in solution and anotiiear nucldc acid sequence immobilized on a solid support (e.g., 
paper, membranes, filters, chips, pins or glass slides, or any other appropriate substrate to which cells 
or thdt nuddc adds have been fixed). 

The words "msertion" and "addition" refer to dianges in an amino add or nudeotide sequaice 
resulting in the additim of one or more amino add residues or nudeotides, respectivdy. 

"Iraraune response" can refo: to conditions assodated witii inflammation, tiauma, immune 
disorders, or infectious or genetic disease, etc. These conditions can be characterized by expression of 
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various factors, e.g., cytokines, chemoMnes, and otber signaling mcdecales, which may affect ceSlular 
and systemic define systems. 

An "immunog^c fragment" is a polypeptide or oligopeptide fragment of TRICH vrtiich is 
capable of diciting an immmie response when introduced into a.living organism, for exanqfle, a 
5 mammal. The term "immunogenic fragment" also includes any polypeptide or oligopeptide fragmiKDt of 
TRICH which is useM in any of the antibody production methods disclosed herdn or known in the art. 

The term "microarray*' refers to an arrangement of a plurality of polynucleotides, polypeptides, 
or other chemical compounds on a substrate. 

The tenns "element" and "array elemmt" refer to a polynucleotide, polypq)tide, or other 
10 dieoaical compound having a unique and defined position on a microarray. 

The tmn "modulate" refers to a change in the activity of TRICH, For exan5)le, modulation 
may cause an increase or a decrease in protein activity, binding characteristics, or any other biological, 
functional, or innnsnnological properties of TRICH. 

The phrases **nucleic acid" and "nuddc add sequaice" refer to a nucleotide, oligonucleotide, 
15 polynudeotide, or any fragment thereof. These phrases also refa to DNA or RNA of genomic or 
synthdic origin which may be single-stranded or doubl&-stranded and may represent the sense or the 
antisense strand, to peptide nucldc add (PNA), or to any DNA-lite or RNA-like material. 

"Oporably linked" refa:^ to the situation in which a first nucldc acid sequence is placed in a 
functional relationship with a second nucleic add sequence. For instance, a promoter is operably 
20 linked to a coding sequence if the promoter affects the transcription or expression of the coding 
sequence. Qperably linked DNA sequ^ces may be in close proximity or contiguous and, where 
necessary to join two protdn coding regions, in the same reading frame. 

•Teptide nucldc add" (PNA) refers to an antiseose molecule or anti-gme agent which 
comprises an oligonucleotide of at least about 5 nucleotides in Imgth linked to a pqptide backbone of 
25 amino add residues ending in lysine. The terminal lysine confers solubili^ to the composition, PNAs 
prefer^ally bind conqdemmtary single stranded DNA or RNA and stop transcript elongation, and 
may be pegylated to ext^d their lifespan in the cell. 

"Post-translational modification" of an TRICH may involve lipidation, glycosylation, 
phosphorylation, acetylation, racemization, proteolytic cleavage, and oth^ modifications known in the 
30 art. Theseprocessesmayoccur synthetically or biodiemically. Biochemical modifications will vary by 
cdl type depending on the enzymatic milieu of TRICH, 

"Probe" refers to nucleic acid sequences encoding TRICH, their complements, or fragments 
thereof, whidi are used to detect identical, allelic or related nucleic acid sequences. Probes are 
isolated oligonucleotides or polynudeotides attadied to a detectable labd or reportea: molecule. Typical 
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labds indude radioactive isotq)es, ligands, diemnrnirinescatt agents, and eaizynies. "PrimHis" are 
short nucleic adds, usually DNA oligonudeotides, wbidi may be annealed to a target pdynncleotidfe by 
con^lementary base-pairing. The primer may then be extaided along the target DNA strand by a DNA 
polymerase enzyme. Primer pairs can be used for amplification (and identification) of a nuddc add 

5 sequence, e.g., by the polymerase chain reaction (PGR). 

Probes and prime's as used in the present invention typically comprise at least 15 contiguous 
nucleotides of a known sequence. In order to enhance spedfidty, longer probes and primers may also 
be enqfloyed, such as probes and primers that comprise at least 20, 25, 30, 40, 50, 60, 70, 80, 90,i 100, 
or at least 150 consecutive nudeotides of the disdosed nuddc add sequences. Probes and primers may 

10 be considerably longer than these examples, and it is understood that any length suppcffted by the 
specification, induding the tables, figures, and Sequence Listing, may be used. 

Mdhods for prqpariiig and using probes and primers are described in the references, for 
exanqjle Samhrook, J. et al. (1989) Molecular ao ning: A Laboratorv Manual, 2™" ed, vol. 1-3, Cdd 
Spring Harbor Press, Hainview NY; Ansubd, F.M. et aL (1987) Cutreqt Protocols in Molecular 

15 Bidogy. Greene PubL Assoc. & Wiley-Intersciences, New York NY; Lnnis, M. et aL (1990) PGR 
Protocols. A Guide to Method's and Applications. Academic Press, San Diego CA. PGR primer pairs 
can be dalved flrom a known sequaice, for exmspls, by ushig con?)uter programs intended f(x that 
purpose such as Primer (Version 0.5, 1991, Whitehead Institute for Biomedical Researdi, Gamhridge 
MA).. 

20 Oligonucleotides for use as prhners are sdected usuig software known in the art for such 

purpose. For exanqjle, OLIGO 4.06 software is usefial for the selection of PGR primer pairs of up to 
100 nucleotides each, and for the analysis of oligonudeotides and larger pdynndeotides of up to 5,000 
nucleotides from an input polynucleotide sequence of up to 32 kilobases. Shnilar primer sdection 
programs have incorporated additional features for expanded capabilities. For exanple, tiie PrimOU 

25 primer sdection program (available to the pubUc from the Genome Genter at University of Texas Soufli 
West Medical Center, Dallas TX) is capable of dioosmg specific primers from megabase sequences 
and is thus useful for designing primers on a genome-wide scope. The PrimerS primer sdection 
program (available to ttie public from the Whitehead Institute/MIT Genter for Genome Research, 
GambridgeMA) allows the user to input a "mispriming library," in which sequences to avoid as pruner 

30 bindmg sites are user-specified. PrimerS is useful, in particular, for the sdection of oligonucleotides for 
microarrays. (The source code for the latter two primer sdection programs may also be obtained from 
tiieir respective sources and modified to meet the user's spedfic needs.) The PrimeGea program 
(available to the puWic from tiie UK Human Genome Mapping Project Resource Centte. Cambridge 
UK) designs primers based on multiple sequence alignments, thereby allowing sdection of primers that 
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hyWdize to eiflxerthemostcoiiservedor least ccfflservedregion^ 

H^ce. this program is useful for identificatioa of boQi lurique and conserved digonucleottdes and 
polynucleotide fragments. The oligonucleotides and polynucleotide fragm^ identified by any of tbe 
above sdectionmethodsareusefulinhybridizationtecMologies, for example, as PGR or 

5 primers, microairay dem«its. or specific probes to identify fully or parttaHy complementary 

pdymdeotidesinasampleofnudeicacids. Methods of oligonucleotide selection are not limited to 

those described above. 

A ••recombinant nuddc add" is a sequence tiiat is not natinrally occurring or has a sequence 
ttot is made by an artifidd comMnation of two or more otherwise s^arated segm^te 
10 ThisartifidalcombinationisoflmaccompMjedbydjenucalsynthesisor^ 

artifidal manipulation of isolated segmente of naddc adds. ag.. by genetic engineering techniques 
. sudiasthosedescribedinSambrodk.suEra. The term recombinant indudes nuddc adds Ibat have 
been altered scddy by addition, substitution, or ddetionafaportionofti«enucldcadd.Fr«^^ 
recombinant maddc addmay include a nuddc add sequence operably linked to a promoter sequence. 
15 Sudiarecombinantnuddcaddmaybepartofavectorti^isused.foracample.totransforma^^ 

Mtemativdy, sudi recomWnant nuddc adds may be part of a viral vedor. e. g. . based on a 
vaodma virus, tiiat cooldbeuseto vacdnate a mammal whecdnlherecombimuit nu^^^ 
expressed, indudng a protective immnnDlogical response inihe mammal. 

A "regulatory elem^' refers to a nuddc add sequence usually derived fl-om mitranslated 
20 regions of a gene and includes enhancers, promoters, intxons. andS' and 3' untranslated regions (UTRs). 
Regulatory dements interact witti host or viral protdns whidi control transcr^on, translation, or RNA 
stability. 

"Reporter molecules" are dianical or biodiemical moieties used for labding a nuddc aad, 
amino add, or antibody. Rq)orter molecules include radionuclides; enzymes; fluorescent, 
25 diemilmninescent. or chromogenic agents; substrates; cofactors; inhibitors; magnetic partides; and 

other moieties known in the art 

An "RNA equivalent," in reference to a DNA sequence, is composed of ti^e same linear 

sequaice of mideotides as tiie reference DN A sequence witii ti^e exception lhat an occurrences 
nitrogenous base thymine are rq,lacedwithm:adl. and tiie sugar bad*oneiscon5)osed of rib^^ 

30 instead of decHcyribose. 

The term "sample" is used in its broadest sense. A sample suspected of containing TRICH, 
«xddc adds encoding TRICH. or fragments thaeof may comprise a bodily fluid; an extract from a 
cdl. diromosome. organdie, or mmbrane isolated from a cdl; a cdl; genomic DNA. RNA. or cDNA. 
m solution or bound to a substrate; a tissue; a tissue print; dc. 
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The terms "spedfic binding" and "spedfically Wnjfing" refer to liiat interactioa betweai a 
protean or peptide and an'agonist, an antibody, an antagonist, a small molecule, or any natural or ' 
synfljetic binding compositioa The interaction is dependKitiq)on1hepreseiK» of a partioflarstnicture 
of the protdn, e.g., tiie antigenic determinant or qdtope, recognized by the binding molecule. For 
5 exan5)le, if an antibody is specific for epitope "A," me presence of a polypqrtide comprising the epitope 
A, or the presence of free unlabeled A, in a reaction containing free labeled A and the antibody will 
reduce tiie amount of labeled A that binds to the antibody. I 
The term "substantially purified" refers to nucleic add or amino acid sequaices that are! 
removed from their natural oiviromnMit and are isolated or separated, and are at least 60% free, 
10 preferably at least 75% free, and most preferably at least 90% free from other components with which 
they are naturally associated 

A "substitutiOT." refers to the replacanrat of one or mxe amhio add residues or nudeotides by 
different amino add residues or nudeotides, respectivdy. 

"Substratje" rrfers to any suitable rigid or sanl-rigid siqjport including membranes, filters, 
15 daps, sUdes, wafers, fibers, magnetic or nonmagnetic beads, gds, tubing, plates, polymers, 
mlCTOpartides and cqallaries. The substrate can have a variety of surface forms, sudi as wdls, 
trendies, pins, dianneas and pores, to which pdynudeotides or polypeptides are bound 

A "ttanscript image" refers to the collective pattan of gaie ejcpressionby a particular cell type 
or tissue muter ^ean conditions at agiventime. 
20 "Ttansformatioa" desalbes a process by whidi ex(^eaKJUS DNA is introduced into a redpient 

cdL Transfonnation may occur under natural or artifidal conditions according to various methods wcffl 
known m the art, and may rdy on any known method for the hisertion of fordgn nuddc add sequaices 
intoaprokaryoticoreakaryotichostcdL The method for transfonnation is sdected based on the type 
of host cdl bdng transformed and may mdude, but is not limited to, bacteriophage or viral infection, 
25 electrqporation, heat shock, Upofection, and particle bombardment. The term 'transformed cells" 
includes stably transformed cdls in which the inserted DNA is capable of repKcation either as an 
autonomously replicating plasmid or as part of the host diromosome, as weill as transientiy transformed 
ceils which express the inserted DNA or RNA for limited periods of time. 

A "transgenic organism," as used herein, is any organism, including but not limited to 
30 animals and plants, in which one or more of the cells of the organism contains heterologous nucleic 
acid introduced by way of human intervention, such as by transgenic techniques wdl known in the 
art The nuddc add is introduced into the cdl, directly or mdirectly by introduction into a precursor 
of the cdl, by way of ddiberate gtmetic manipulation, such as by miaoinjection or by infection witii 
a recombinant vhns. The tenn genetic manipulation does not hiclude dassical ctoss-breedmg, or in 
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vino fertilization, but rather is directed to the introduction of a recombinant DNA molecule. The 
transgenic organisms conten?)lated in accordance witlithe present inveidiaa indude bacteria, 
cyaaobacteria, fungi, plants and animals. The isolated DNA of the present inventiori can be 
introduced into the host by methods known in the art. for example infection, transfection, 
transformation or transconjugation. Techniques for transferring tiie DNA of the present invention 
into such organisms are widely known and provided in references such as Sambrook et aL (1989). 
supra . 

A "variant" of a particular nucleic add sequence is defined as a nuddc add sequence having at 
least 40% sequence identity to the particular nucldc acid sequence over a certain length of one of flie 
nucleic acid sequences using blastn with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07-1999) 
set at default parameters. Such a pair of nucleic acids may show, for example, at least 50%, at least 
60%, at least 70%, at least 80%, at least 85%. at least 90%. at least 95% or at least 98% or greater 
sequence identity over a certain defined length. A variant may be described as, for example, an "alldic" 
(as defined above), "splice," "spedes." or "polymorphic" variant. A spUce variant may have significant 
identity to a reference molecule, but will generally have a greater or lesser number of polynucleotides 
due to alternative splicing of exons during mRNAprocessmg. The cotresponding polypeptide may 
possess additional functional domains or lack domains that are present in the reference molecule. 
Spedes variants are polynudeodde sequences that vary from one spedes to another. The resulting 
polypeptides will generafly have significant amino add idaitity rdativeto eadi other. Apolymorphic 
variant is a variation in the pcflynudeotide sequence of a particular gene between hidividuals of a given 
spedes. Polymorphic variants also may eoconpass "smgle nudeotide polymorphisms" (SNPs) in 
whidilhepolynndeotide sequence varies by (me nudeotide base. The presence of SNPs maybe 
hidicative of, for exan9)le, a certain population, a disease state, or a propensity for a disease state. 

A "variant" of a particular polypeptide sequence is defined as a polypeptide sequence havhig at 
least 40% sequence identity to the particular polypeptide sequence over a certain length of one of tiie 
polypeptide sequaices using blastp with tiie "BLAST 2 Sequences" tool Version 2.0.9 (May-Or-1999) 
set at default parameters. Such apair of polypeptides may show, for example, at least 50%. at least 
60%, at least 70%, at least 80%, at least 90%, atleast 95%, or atleast98% or greater sequence 
identity over a certain defined lengtii of one of the polypq)tides. 

THE INVENTION 

The invention is based on tiie discovery of new human transporters and ion diannds (TRICH), 
thepolynudeotides encoding TRICH, and tiie use of these compositions for ttie diagnosis, tieatinent, or 
prevention of ti-ansport, neurological, musde, and unmunological disorders. 
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Table 1 summarizes the nommclature for the full leagUi polynucleotide and polypeptide 
sequeaices of the inventioa Each polynucleotide and its corresponding polypeptide are corrdated to a 
single Incyte project identification number (Incyte Project ID). Each polypq)tide sequ^ice is denoted 
by botb a polypeptide sequence idmtification numbea: (Polypeptide SEQ E) NO:) and an Incyte 

5 polypeptide sequcaicenmriba: (Incyte Polypeptide ID) as sto Each polynucleotide sequence is 
denoted by both a polynucleotide sequence idmtification number (Polynucleotide SEQ ID NO:) and an 
Incyte polynucleotide consensus sequence number (Incyte Polynucleotide ID) as shown. | 

Table 2 shows sequences with homology to the polypeptides of the invention as identified by 
BLAST analysis against the GenBank protein (genpept) database. Columns 1 and 2 show the 

10 polypeptide sequence identification numbea: (Polypeptide SEQ ID NO:) and the corresponding Incyte 
polypeptide sequence number (Incyte Polypeptide ID) for each polypq)tide of the invention. Column 3 
shows the GenBank identification numbo: (Genbank ID NO:) of the nearest GaiBank homolog. 
Column 4 shows the probability score for tiie match between each polypeptide and its GenBank 
homolog. Column 5 shows the annotation of the GenBank homolog along with relevant citations where 

15 applicable^ all of which are expressly incorporated by refer^ice herdn. 

Table 3 shows various structural features of each of tlie polypeptides of the invention. Columns 
1 and 2 show the polypq)tide sequence identification mwbea: (SEQ ID NO:) and tiie corresponding 
Incyte polypeptide sequence number (Incyte Polypeptide ID) for each polypeptide of the invoitioa 
Column 3 shows the nmriber of amino add residues in each polypeptide. Column 4 shows potential 

20 phosphorylation sites, and column 5 shows potential glycosylation sites, as determined by the MOTIFS 
program of the GCG sequaice analysis software package (Gmi^cs Conq?utar Group, Madison WI). 
Column 6 shows amiao add residues comprising signature sequences, domains, and motifs. Column 7 
shows analytical mdhods for protein structure/fimction analysis and in some cases, searchable 
databases to whidi the analytical methods were applied. 

25 As shown in Table 4. the fiill lengfli polynucleotide sequeaces of the present hrvention were 

assembled using cDNA sequences or coding (exon) sequences derived from genomic DN A, dr any 
combmation of these two types of sequences. Columns 1 and 2 list the polynucleotide sequence 
identification number (Polynucleotide SEQ ID NO:) and flie corresponding Incyte polynudeottde 
consensus sequence number (Incyte Polynucleotide ID) for each polynucleotide of the mvention. 

30 Column 3 shows tiie length of each polynucleotide sequence m basepairs. Colunan 4 lists fragments of 
the polynucleotide sequences which are useful, for example, in hybridization or amplification 
technologies tiiat identify SEQ ID NO:28-54 or tiiat distinguish between SEQ ID NO:28-54 and 
related polynucleotide sequences. Column 5 shows identification numbers corresponding to cDNA 
sequences, coding sequences (exons) predicted from genomic DNA, and/or sequence assanblages 
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comprised of both cDNA and genomic DNA. These sequences wereused to assemble Ihe full length 
polynucleotide sequaices of Ihe invention. Cdtamns 6 and7 of Table 4 show the inideotide start (50 
and stop (30 positions of the cDNA and genomic sequaices in column 5 relative to their respective full 
length sequences. 

The identification numbers in Column 5 of Table 4 may lefar spedfically, for example, to 
Incyte cDNAs along with their corresponding cDNA libraries. For exanq)le, 6813453H1 is the 
identification number of an Incyte cDNA sequence, and ADRETUROl is the cDNA library firom wMdi 
it is derived. Incyte cDNAs for which cDNA Ubraries are not indicated were derived from pooled 
cDNAUhraries (e.g., 70207988V1). Alternatively, the identification numbers in column 5 may refer to 
GenBank cDNAs or ESTs (e.g., gl947104) which contributed to the assembly of the full length 
pdynncleotide sequences. Alternativdy, the identification numbers in column 5 may refer to coding 
regions predicted by Genscan analysis of genomic DNA For exanqjle. GNN.g6554406_006 is the 
identification number of a Genscan-predicted coding sequence, vwth g6554406 being the GeiiBank 
identification number of the sequaice to which Genscan was applied. The Genscanrpredicted coding 
sequencesmayhavebeeneditedptiortoassembly. (SeeExan5)leIV.) Altemativdy. the identification 
numbers in column 5 may refer to assrariblages of both cDNA and Genscan-predicted exons brought 
together by an "econ stitching" algorithm. (See Example V.) AMemativdy. the identificatiQn numbers 
in column 5 may tefa to assemblages of both cDNA and Genscan-predicted exons brought together by 
an "exon-stretchmg" algorithm. (SeeExan?)leV.) In some cases. Incyte cDNA coverage redundant 
vwththesequencecoverageshovmincdumnS was obtainedto confirm the final consaisus 
polynucleotide sequence, but the relevant hicyte cDNA identification numbers are not showa 

Table 5 shows the rq)resentafive cDNA Ubraries for those fiill length polynucleotide sequences 
which were assembled using Incyte cDNA sequences. The rq«resentative cDNA library is the Incyte 
cDNA Ubrary which is most frequenfly represented by the Incyte cDNA sequences which were used to 
assanble and confirm the above polynucleotide sequences. Thetissues and vectors which were used to 
construct the cDNA libraries shown m Table 5 are described in Table 6. 

The invention also encompasses TRICH variants. A preferred TRICH variant is one whichhas 
at least about 80%, or alternatively at least about 90%, or even at least about 95% amino add sequence 
identity to the TRICH amino add sequence, and whidi contains at least one functional or structural 

characteristic of TRICH. 

The invention also encompasses polynucleotides whidi encode TRICH. In a particular 
embodiment, the hxvention encompasses a pdynudeotide sequence comprising a sequence sdected firom 
the group consisting of SEQ ID NO:28-54, whidi encodes TRICH. ThepcAynudeotide sequences of 
SEQ ID NO:28-54, as presented hi tiie Sequence Listing, aribrace tiie equivalent RNA sequences, 
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wlieardn occurrmces of the mtrogenous base fbymine are replaced witli uracil, and fbe sugar l)ac]dx}ne 
iscon^osedof riboseiiisteadaf deoxyribose. ' 

Tbe invention also encompasses a variant of a polynucleotide sequence encoding TRICH. In 
particular, such a variant polynucleotide sequence will have at least about 70%, or altemativdy at least 

5 about 85%, or even at least about 95% polynucleotide sequence id^tity to tte polynucleotide sequence 
encoding TRICH. A particular aspect of the inv^tion mcon5)asses a variant of a polynucleotide 
sequence comprising a sequence selected from the group consisting of SEQ ID NO:28-54 v^hich has at 
least about 70%, or alternatively at least about 85%, or even at least about 95% polynucleotide ' 
sequence identity to a nucleic acid sequence selected from tiie group consisting of SEQ ID NO:28-54. 

10 Any one of the polynucleotide variants described above can encode an aniino add sequmce A^ch 
contains at least one functional or structural characteristic of TRICH, 

It win be appreciated by those sldUed in the art that as a result of the degenCT^ 
code, a multitude of polynucleotide sequences encoding TRICH, some bearing minimal similarity to the 
polynucleotide sequences of any known and naturaUy occurring gene, may be produced. Thus, the 

15 invention contraiplates each and every possible variation of polynucleotide sequence that could be made 
by sdecting combinations based on possible codon choices. These combinations are made in 
accordance with the standard triple genetic code as applied to the polynucleotide sequence of naturally 
occurring TRICH, and all such variations are to be considered as bdng spedficaUy disclosed. 

Although nucleotide sequences which encode TRICH and its variants are generally capable of 

20 hybridizing to the nucleotide sequence of the naturally occurring TRICH undo: appr(5)riatdy sdected 
conditions of stringency, it may be advantageous to produce nucleotide sequeaces encoding TRICH or 
its derivatives possessing a substantially differmt codon usage, e.g., inclusion of non-naturalLy 
occurring codons. Codons may be selected to inaease the rate at which expression of the peptide 
occurs in a particular prokaryotic or eukaryotic host in acccardance with the frequency with whidbt 

25 particular codons are utilized by the host. Other reasons for substantially altering the nucleotide 
sequence encoding TRICH and its derivatives without altering the encoded amnio add sequences 
include the production of RNA transcripts having more desirable properties, such as a greata* half-life, 
than transcripts produced from the naturally occurring sequ^ce. 

The invention also encompasses production of DNA sequences which encode TRICH and 

30 TRICH derivatives, or fragments thereof, entirely by synthetic chemistry. Aft^ production, the 

synthetic sequence may be inserted into any of the many available expression vectors and cell systems 
using reagmts wdl known in the art Moreovo*, synthetic chranistry may be used to introduce 
mutations into a sequaice aicoding TRICH or any fragmait thereof. 

Also encon^assed by the invention are polynucleotide sequences that are enable of 
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hybridizing to the claimed polynucleotide sequcoces, and, in particular, to those shown in SEQ ID 
NO:28-54 and fragments thereof undo: various conditions of stringency. (See, e.g., WaM, GM. and 
SX. Berger (1987) Methods EnzymoL 152:399-407; Khrnnd, A.R. (1987) Methods Enzymol. 
152:507-511.) Hybridization conditions, including annealing and \yash conditians, are described in 
5 "Definitions." 

Methods for DNA sequencing are well known hi the art and may be used to practice any of the 
embodiments of the uivention. The methods may employ such enzymes as the Kleuow fragment of 
DNA polymerase I, SEQUENASE (US Biochemical, Cleveland OH), Taq polymerase (PE 
Biosystems), thermostable T7 polymerase (Amersham Pharmacia Biotech, Piscataway NJ), or 

10 combinations of polymerases and proofreadmg exonucleases such as those found m the ELONGASE 
amplification system (Life Technologies, Gaithersbnrg MD). Preferably, sequence prq)aration is 
automated with machines such as the MICROLAB 2200 hquid transfer system (Hamilton, Reno NV), 
PTC200 thermal cycler (MJ Research, Watertown MA) and ABI CATALYST 800 thermal cycler (PE 
Biosystems). Sequeodng is then carried out usmg either the ABI 373 or 377 DNA sequencing system 

15 (PE Biosystems), the MEGAB ACE 1000 DNA sequencmg system (Molecular Dynamics, Sunnyvale 
C A), or other systems known in the art The resulting sequmces are analyzed usmg a variety of 
algorithms which are wen knownin the art. (See, e.g., Ausubd, RM. (1997) Short Protocols m 
Molecular Biology. John Wiley & Sons, New York NY, unit 7.7; Meyers. R. A (1995) Molecular 
Biology and p intftnhnnlo pv. Wiley VCH, New York NY, pp. 856-853.) 

20 The nudeic acid sequences encoding TRICH may be e3rtended utihzing a partial nucleotide 

sequence and engdoying various PCR-based methods known in the art to detect upstream sequences, 
such as promoters and regulatory demmts. For sample, one method which may be anployed, 
restriction-site PGR, uses universal and nested primers to amplify unknown sequence from genomic 
DNA withm a clomng vector. (See, e.g., Sarkar, G. (1993) PGR Methods Applic. 2:318-322.) 

25 Another method, hiverse PGR, uses primers that extend in divergent directions to amplify unknown 
sequence from a circularized template. The template is derived from restriction fragments comprismg a 
known genomic locus and surrounding sequences. (See, e.g., TrigUa, T. et al. (1988) Nucldc Adds 
Res. 16:8186.) Athird method, capture PGR, involves PGR amplification of DNAfragmmts adjacent 
to known sequences in human and yeast artificial chromosome DNA. (See, e.g., Lagerstrom, M. et al. 

30 (1991) PGR Methods Apphc. 1:111-119.) hi this method, multiple restriction ^yme digestions and 
ligations may be used to insert an engmeered double-stranded sequaice mto a region of unknown 
sequence before performing PGR. Other methods which may be used to retrieve unknown sequences 
areknownin the art. (See, e.g., Parker, J.D. et al. (1991) Nucleic Acids Res. 19:3055-3060). 
Additionally, one may use PGR, nested primers, and PROMOTERFINDER Ubraries (Glontech, Palo 
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Alto C A) to walk genomic DNA, This porocedure avoids the need to saeea libraries and is useful in 
finding intron/exon junctions. For an PCR-based methods, primers may be designed using ' 
commercially available software* such as OLIGO 4.06 primer analysis software (National Biosd^ices, 
Plymouth MN) or another ^propriate program, to be about 22 to 30 nucleotides in length, to have a 
5 GC content of about 50% or more, and to anneal to the template at toiqperatures of about 68°C to 
72°C, 

When screening for fuH length cDNAs, it is preferable to use libraries that have been ( 
size-selected to include larger cDNAs. In addition, random-primed libraries, which oftm include 
sequ^es containing the 5' regions of genes, are preferable for situations m which an oligo d(T) library 

10 does not yield a full-length cDNA. Genomic libraries may be useful for extension of sequence into 5' 
non-transaibed regulatory regions. 

Capillary electrophoresis systems which are commercially available may be used to analyze the 
size or confirm the nucleotide sequence of sequencmg or PGR products. In particular, capillary 
sequencing may CTiploy flowable polymers for electrophoretic separation, four different nucleotide- 

15 specific, laser-stimulated fluorescent dyes, and a charge coupled device camera for detection of the 
emitted wavelengths. Output/light int^ity may be converted to electrical signal using appropriate 
software (e.g., GENOTYPER and SEQUENCE NAVIGATOR, PE Biosysteoos), and ^ entire process 
fi-om loading of samples to coin)uter analysis and dectronic data display may be con^^uter controlled. 
Cq)inary dectrophoresis is especially pref arable for sequencing small DNA fragments whidi may be 

20 present m limited amounts in a particular sample. 

In another embodiment of the invention, polynucleotide sequences or ft agmmts thereof which 
encode TRICH may be doned in recombinant DNA molecules that direct egression of TRICH, or 
fragments or functional equivalents thereof, in appropriate host cells. Due to the inherent degeneracy of 
the gen^c code, oflier DNA sequences whidi mxxie substantially the same or a functionally equivalent 

25 amino add sequence may be produced and used to escpress TRICH, 

The nucleotide sequraces of the present invention can be engineered using methods generally 
known in the art in order to alto: TRICH-mcoding sequences for a variety of purposes including, but 
not limited to, modification of the cloning, processmg, and/or expression of the gene product. DNA 
shuffling by random fragmentation and PGR reassembly of gme fragments and synthetic 

30 oligonucleotides may be used to engmeer the nucleotide sequences. For example, oligonucleotide- 

mediated site-directed mutagenesis may be used to introduce mutations that aeate new restriction sites, 
alter glycosylation patterns, change codon preference, produce splice variants, and so forth. 

The nucleotides of the present invention may be subjected to DNA shuffling techniques such 
as MOLECULARBREEDING (Maxygen Inc., Santa Clara CA; described in U.S. Patent Numba: 
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5.837,458; Chang. C.-C. et al. (1999) NaL Biotechnol. 17:793-797; Christians, F.C. et aL (1999) NaL 
Biotechnol. 17:259-264; and Crameii, A. et al. (1996) Nat BiotechnoL 14:315-319) to alter or 
improve the biological properties of TRICH, such as its biological or enzymatic activity or its ability 
to bind to other molecules or compounds. DNAshufQingisaprocessby which a library of gene 
variants is produced using PCR-mediated recombination of gene fragments. THeUbrary isthsn 
subjected to selection or screening procedures that identify those gene variants wifli the desired 
properties. THese preferred varianis may then be pooled and further subjected to recursive rounds of 
DNA shuffling and selection/screening. THus, genetic diversity is created through "artificial" 
breeding and rapid molecular evolution. For example, fragments of a single gene containing random 
pomt mutations may be recombined, screened, and then reshuffled untfl the desired properties arc 
optimized. Altexnatively. fragments of a given gene may be recombmed witii fragments of 
homologous genes m the same gene family, either from the same or different species, thereby 
maximizmg the genetic diversity of multiple naturaUy occurring genes in a directed and controllable 
manner. 

In another embodhnent, sequences encoding TRICH may be synthesized, in whole or m part, 
using chemical methods wdl knowninthe art (See, e.g., Caruthers, M.H. et al. (1980) Nucleic Adds 
Syrap. Ser. 7:215-223; and Horn, T. et al. (1980) Nuddc Adds Symp. Ser. 7:225-232.) Altemativdy. 
TRICH itsdf or a fragment thereof may be synthesized using chemical methods. For exanqde, peptide 
synlhesis can be perfcmned using various solution-phase or solid-ifliasetedmiques. (See, e.g., 
Crdghton, T. (1984) T>rfrfrim Stnich irfts and Molecular Properties. WH Freeman, New YorkNY, 
pp.55-60; and Roberge. J.Y. et aL (1995) Sdence 269:202-204.) Automated synthesis may be 
adieved using the ABI 431Apq)tide synthesizer (PE Biosystems). Additionally, tiie amino add 
sequence of TRICH. or any part tiiereof, may be altered during direct synthesis and/or combined with 
sequences from other protons, or any part tiiereof. to produce a variant polypeptide or a polypq)tide 
having a sequaice of a naturally occurring polypeptide. 

The peptide may be substantially purified by prq)arative high performance liquid 
chromatography. (See, e.g., Chiez, RJVI. and F.Z. Regnier (1990) MdhodsEnzymdL 182:392-421.) 
The composition of tiie synthetic peptides may be confirmed by amino add analysis or by sequendng. 
(See, e.g., Crdghton, supra , pp. 28-53.) 

In order to express a biologicany active TRICH. tiie nucleotide sequences encoding TRICH or 
derivatives tiiereof may be inserted into an appropriate expression vector, i.e., a vector ^di contains 
tiie necessary elmients for transcriptional and translational control of tiie inserted codmg sequaice in a 
suitable host These demaits include regulatory sequences, sudi as enhancers, constitiitive and 
hidudble promoters. and5'and3'untranslated regions in tiie vector and in polynudeotide sequences 
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encoding TRICH. Siich danents my vary in their straigthaiKispecifid Specific iMtiation signals 
may also be used to achieve more efficient translation of sequoices encoding TRICR Sudi signals ' 
include the ATG initiation codon and adjacent sequences, e.g.ttieKozaJcseqa{aice. Incases where 
sequences encoding TRICH and its initiation codon and upstream regulatory sequences are inserted into 

5 the appropriate expression vector, no additional transcriptional or translational control signals may be 
needed. However, in cases where only coding sequence, or a fragment thereof, is inserted, exogaious 
translational conlrol signals including an in-frame ATG initiation codon should be provided by the| 
vector. Exogenoastranslationaldementsandinitiationcod6nsmaybeofvariousorigins,bothnatural 
andsynaelic. meffidencyof expression may be enhanced by the inclusion of enhancers appropriate 

10 for Iheparticular host ceQ syslemused. (See. e.g.. Sdiarf, D. et aL (1994) Results Probl. Cdl Differ. 
20:125-162.) 

Methods-wMdi are wdl known to those skilled in the art may be used to construct expression 
vectors containing sequences encoding TRICH and appropriate transcriptional and translational control 
dements, iiiese methods indudeiiL^ recombinant DNA techniques, synthetic te^ 
15 23322 genetic recombination. (See. e.g., Sambrook, J. et aL (1989) MolmilBrnoninp, A Laboratory 
Manual. Cdd Spring Harbor Press. Hainview nV, di. 4. 8. and 16-17; Ausubd, FM. et aL (1995) 
nirr^nt Protocols <ti Molecular Biology . John Wiley & Sons. New York NY. dt 9, 13. and 16.) 

Avariety of expression vedor/host systems may be utilized to contain and express sequences 
encoding TRICH. Tliese indude. but are not limited to. microorganisms sudi as baderia transformed 
20 with recombinant baderiophage. plasmid. or cosmid DN A expression vedors ; yeast transformed with 
yeast expression vedors; insect cell systans infeded witH viral raqiression vedors (e.g., baculovirus); 
plant cell systems transformed with viral expression vedors (e.g.. cauliflower mosaic virus, CaMV, or 
tobacco mosaic virus, TMV) or xvithbaderial expression vedors (e.g.. Ti or pBR322plasmids); or 
animal cdl systems. (See, e.g.. Sambrook. supra; AusubeL supra; Van Hed^e. G. and S.M. Sdiuster 
25 (1989) J. Biol. Chem. 264:5503-5509; Engelhard, E.K et al. (1994) Proc. Natl. Acad. Sd. USA 

91:3224-3227; Sandig, V. d al. (1996) Hum. GenelHer. 7:1937-1945; Takamatsu.N. (1987) EMBO 
J. 6:307-311; The. McGraw Hin Yparhnok of S n^r^r^. and Technology (1992) McGraw Hill. New 
Yori. NY. pp. 191-196; Logan, J. and T. Shenk (1984) Proc. Natl. Acad. Sd. USA 81:3655-3659; and 
Hanington,J.J.etal. (1997) Nat Genet 15:345-355.) Expression vedors derived from retrovfruses. 
30 adenoviruses, or herpes or vacdnia viruses, or from various baderial plasmids, may be used for 
ddivery of nndeotide sequences to the targeted organ, tissue, or cell population. (See, e.g., Di 
Nicola. M. d SI. (1998) Cancer Gen. Iher. 5(6):350-356; Yu, M. d al. (1993) Proc. Nati. Acad. ScL 
USA 90(13):634O-6344; Buller. R.M. et aL (1985) Nature 317(6040):813-815; McGregor. D.P. et al. 
(1994) Mol. ImniunoL 31(3):219-226; and Venna, IM. andN. Somia(1997) Nature 389:239-242.) 
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The inventiQa is not limited by the host cell enqfloyeA 

In bactaial systans, a number of cloning and expression vectors may be sdected depending 
upon the use intmded for polynucleotide sequaices encoding TfUCH. For example^ routine cloning, 
subcloning» and propagation of polynucleotide sequences encoding TRICH can be adbieved using a 

5 multifunctional E. coU vector such as PBLUESCRIPT (Stratag^e, La JoDa CA) or PSPORTl plasmid 
(Life Technologies). Ligation of sequences encoding TRICH into the vector's multiple cloning site 
disrupts the lacZ gene, allowing a colorimetric screening procedure for identification of transformed 
bacteria containing recombinant molecules. In addition, these vectors may be usefiil for in vitro ! 
transcription, dideoxy sequencing, single strand rescue with heJper phage, and creation of nested 

10 deletions in ttie cloned sequence. (See, e.g.. Van Heeke, G. and S.M. Schuster (1989) J. Biol. Chem. 
264:5503-5509.) When large quantities of TRICH are needed, e.g. for the production of antibodies, 
vectors which direct high levd ejqpression of TRICH may be used. For exan5)le, vectors containing the 
strong, inducible SP6 or T7 bacteriophage promoter may be used. 

Yeast esqwession systems may be used for production of TRICH. A number of vectors 

15 containing constitutive or inducible promoters, such as alpha factor, alcohol oxidase, and PGH 
promoters, may be used in the yeast Saccharomvces cerevisiae or Picbia pastoris . Iq addition, such 
vectCMTS direct eitha: the. secr^on or intracdlular retention of expressed protdns and enable integration 
of foreign sequences into the host g^ome for stable propagatioa (See, e.g., Ausubd, 1995, supra; 
Bitter, G.A et al. (1987) Methods Enzymol. 153:516-544; and Scorer, CA. et al. (1994) 

20 Bio/Technology 12:181-184.) 

Plant systsns may also be used for expression of TRICH. Transcription of sequences encoding 
TRICH may be driven by viral promotars, e.g., the 358 and 19S promota:s of CaMV used alone or in 
combination vrtfttiie omega leader sequmce from TMV(Taka^^ N. (1987)HSffiO J. 6:307-311). 
Alternatively, plant promoters such as the small subunit of RUBISCO or heat shock promoters may be 

25 used. (See, e.g., Coruzzi, G. et al. (1984) EMBO J. 3:1671-1680; BrogUe, R. et al. (1984) Sdmoe 
224:838-843; and Winter, J. et al. (1991) Results Probl. Cdl Differ. 17:85-105.) These constructs can 
be introduced into plant cdls by direct DNA transformation or pathogen-mediated transfection. (See, 
e.g.. The McGraw Hih Yearbook of Science and Technology (1992) McGraw Hill, New York NY, pp. 
191-196.) 

30 In mammalian cdls, a number of viral-based expression systems may be utilized. In cases 

where an adenovirus is used as an e35)ression vector, sequences ^coding TRICH may be ligated into an 
admovirus traDsa:5)tion/translation coinpleat consistmg of the late promoter and tripartite leader 
sequence. Insertion in a non-essential El or E3 r^cm of the viral gmome may be used to obtain 
infective virus which expresses TRICH m host cells. (See, e.g., Logan, J. and T, Shenk (1984) Proc. 
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Natl. Acad Sd. USA 81:3655-3659.) In addition, transcriptioii enhancers, such as toe Rous sarcoma 
virus (RSV) eiihancer, may be used to increase eapression in mammalian host cdls. SV40 or EBV- 
based vectors may also be used for high-levd protdn expressioa 

Human artificial chromosomes (HACs) may also be employed to deiliver larger fragments of 
5 DNA than can be contained in and expressed from a plasmid. HACs of about 6 kb to 10 Mb are 
constructed and delivered via conventional delivor methods (Uposomes, polycationic amino polymers, 
or vesicles) for therapeutic purposes. (See, e.g.. Harrington, J.J. et al. (1997) Nat Genet 15:345-355.) 

Far long tern production of reconibinant protons in mammalian systons, stable expression of 
TRICH in cdl lines is preferred. For enamplt, sequences aucoding TRICH can be transformed into cdl 
10 lines using expression vectors wMchmay contain viral origins of replication and/or endogenous 

ejq)ression dements and a selectable marker gene on the same or on a separate vector. Following the 
introduction of tiie vector, cells may be allowed to grow for about 1 to 2 days in enriched media brfore 
bdngswitdied to sdective media. The purpose of the sdectable marker is to confer resistance to a 
sdective agent, and its presence allows growth and recovery of cdls viMdi successfully eatpress the 
15 introduced sequences. Resistant clones of stably transformed cdls may be propagated using tissue 
culture tedmiques appropriate to the cdl type. 

Any number of sdection systems may be used to recover transformed cdl lines. These include, 
but are not limited to, flie herpes simplex virus iliymidine kinase and adenine phosphoribosyltransferase 
genes, for use in tftr and apr cdls. respecttvdy. (See, e.g.. Wi^, M. et aL (1977) Cdl 1 1 :223-232; 
20 Lowy.I. etal. (1980) Ceil 22:817-823.) Also, antimetabolite, antibiotic, or herbidde resistance can be 
used as the basis for selection For example, dJfi- confers resistanceto methotrexate; neo confers 
resistance to tiie aminoglycosides neomycin and G-418; and flZs andpof confer resistance to 
cMorsulfuron and phosphinotricin acetyltransferase, respectively. (See, e.g., Wigler, M. et 0.. (1980) 
Proc. Nati. Acad. Sci. USA 77:3567-3570; Colber^Garapin, F. et al. (1981) J. Md. Bid. 150:1-14.) 
25 Additional selectable genes have been described, e.g., trpB and hisD, whidi alter cdlular requirements 
for metabolites. (See, e.g., Hartman, S.C. and R.C. MuDigan (1988) Proc. Natl. Acad. Sci. USA 
85:8047-8051.) Visible markers. e.g., anthocyanins, green fiuoresc^t protdns (GFP; Qontech), B 
glucuronidase and its substrate B-glucuronide. or luciferase and its substrate luciferin may be used. 
These markers can be used not only to identify transformants, but also to quantity flie amount of 
30 fransient or stable protdn raqpression attributable to a specific vedxir system (See, e.g., Rhodes, C.A. 
(1995) Mdhods MoL Bid. 55:121-131.) 

Although the presence/absence of marker gea& etpression suggests tiiat tiie gene of interest is 
alsopresent, tiiepresenceand expression of flie gene may needtobeconfirmed. For example, if tiie 
sequence encodtog TRICH is inserted witinn a marker gene sequoice. transformed cdls containhig 
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sequeDces aicoding TRICH can be identified by the absence of markea: gme function. Altonativdy, a 
marker geae can be placed intandmi with a sequaice eaocoding TRICH under Ibe OMJtrol of a single 
promote:. Expression of the marker gene in response to induction or selection usually indicates 
expression of the tandem g^ as iveU. 

5 In general, host cdls that contain the nucleic add sequence encoding TRICH and that eapress 

TRICH may be identified by a variety of procedures laiown to those of skill in the art Tliese 
procedures include, but are not limited to, DNA-DNA or DNA-RNA hybridizations, PCR | 
amplification, and protein bioassay or immunoassay techniques which include membrane, solution, or 
chip based technologies fOT the detection and/or quantification of nucleic acid or protdn sequences. 

10 Immunological methods for detecting and measuring the expression of TRICH using either 

specific polyclonal or monoclonal antibodies are known in the art. Examples of such techniques include 
aizyme-linked immunosorbent assays (ELISAs), radioimmunoassays (RIAs), and fluorescence 
activated cdl sorting (FACS). A two-site, monoclonal-based immunoassay utilizing monoclonal 
antibodies reactive to two non-intof ering q)itqpes on TRICH is prefocred, but a competitive bmding 

15 assay may be employed. These and otlMar assays are wdl known m the art (See, e.g., Hampton, R. et 
al. (1990) Serological Methods, a Laboratorv ManuaL APS Press, St Paul MN, Sect IV; Coligan, J.E, 
et al. fl99T> Current Protocols in Tmmunolopv , Greene Pub. Associates and Wiley-Interscience, New 
York NY; and Pound, J.D. (1998) Immunochcanical Protocols. Humana Press, Totowa NJ.). 

A wide vari^y of labels and conjugation techniques are known by those skilled in the art and 

20 may be used in various nuclac add and amino acid assays. Means for produdng labeled hybridization 
or PCR probes for detecting sequences rdated to polynucleotides encoding TRICH include 
oligolabeling, nick translation, end-labding, or PCR amplification using a labeled nucleotide. 
Alternatively, the sequences encoding TRICH, or any firagments thereof, may be cloned into a vector for 
the production of an mRNA probe. Such vectors are known in the art, are commercially availably and 

25 may be used to syntiiesize RNA probes in vitro by addition of an appropriate RNA polymerase such as 
T7, T3, or SP6 and labeled nucleotides. These procedures may be conducted using a variety of 
commercially available kits, such as those provided by Amersham Pharmacia Biotedi, Prom^a 
(Madison WI), and US Biochemical. Suitable reporter molecules or labels which may be used for ease 
of detection include radionuclides, enzymes, fluorescent, chemiluminescent, or chromogenic agents, as 

30 well as substrates, cofactors, inhibitors, magnetic particles, and the like. 

Host cdls transformed with nucleotide sequences encodmg TRICH may be cultured unda 
conditions suitable for the expression and recovery of the protein from ceil culture. The protein 
produced by a transformed ceil may be secreted or retained intraceJlularly dq)ending on the sequence 
and/or the vector used. As will be understood by those of skill m the art, expressim vectors containing 
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polynucleotides which encode TRICH may be designed to contain signal sequences vAasAx direct 
secretion of TRICH through a prokaryotic or eukaryotic cdl membrane. 

In addition, a host cdl slrain may be chosen for its ability to modulate raqjression <rf the 
inserted sequences or to process the expressed protein in the desired fashion. Such modifications of the 

5 polypeptide include, but are not limited to. acetylation, carboxylalion, glycosylation, phosphorylatian, 
lipidation, and acylation. Post-translational processing which cleaves a "prepro" or "pro" form of the 
protein may also be used to specify protean targeting, foldmg, and/or activity. Different host cells 
which have spedfic cellular machinery and charactmstic mechanisms for post-translational activities 
(e.g., CHO, UeLSi, MDCK, HEK293, and WI38) are available from the American Type Culture 

10 Coflection (ATCC, Manassas VA) and may be chosen to ensure the correct modification and processing 
of the foreign protein. 

In another enibodunent of theinvoition, natural, modified, or recombinant nucleic acid 
sequences encoding TRICH may be ligated to a heterologous sequence resulting in translation of a 
fusion protdn in any of tiie aforementioned host systems. For exaxaplt, a chimeric TRICH protean 

15 contahiing a heterologous moiety that can be recognized by a coramerdally available antibody may 
facilitate tiie screening of peptide libraries &x inhibitors of TRICH activity. Heterologous protein and 
peptide moieties may also fiidlitate purification of fiision proteins using oHnmercially available affinity 
matrices. Such mdetiesindude, but are not limited to, gtatafliioneS-ti-ansfaase (GST); maltose 
bmding protdn (MBP), thioredoxin (Trx), cahnodulin binding peptide (CBP), 6-His, FLAG, c-myc, and 

20 hemagglutinin (HA). GST. MBP, Trx. CBP, and 6-His enable purification of their cognate fusion 
proteins on immobilized glutatiiione, maltose, phenylaisine oxide, calmodulin, and metal-chdate resins, 
respectivdy. FLAG, c-myc, and hemagglutinin (HA) enable immnnoafBnity purification of fiision 
proteins using commerdally available monodonal and polydQnjQ antibodies tiiat spedfically recognize 
tiiese epitope tags. A fusion protdn may also be engineered to contain a protedyttodeavage site 

25 located between ttie TRICH oicoding sequence and the heterologous protdn sequaice, so tijat TRICH 
may be cleaved away from flie heterologous moiety following purification. Methods for ftisiMi protein 
expression and purification are discussed uiAusubd (1995, su^,ch. 10). Avariety of cammerdally 
available Mts may also be used to facilitate expression and purification of fiision protdns. 

maflmher anbodunent of the invention, synthesis of radiolabeled TRICH may be achieved in 

30 33ti2 using the TNT rabbit reticulocyte lysate or wheat germ extiract system (Promega). These systems 
couple ti^cription and tianslation of protdnecodBng sequences operably associated witii flie 17, T3, or 
SP6 promoters. Translation takes place in tiae presence of a radidabeied ammo add precursor, for 
exaiqple, ^S-methionine. 

TRICH of the present invention or fragments tiiereof may be used to screen for conqjounds 
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that specificany bind to TRICH. At least one and up to a plurality of test compounds may be 
saeened for specific binding to TRICH. Examples of test conqwunds include antibodies, ' 
oligonucleotides, proteins (e.g., receptors), or small molecules. 

In one embodiment, the compound thus identified is closdy related to the natural ligand of 

5 TRICH, e.g., a Ugand or fragment thereof, a naUiral substrate, a structural or fimctiond mimetic, or a 
natural bindmg partner. (See, e.g., Coligan, J£. et al. (1991) Cnrrent Protocols in Tmmunology 1(2): 
Chapter 5.) Similarly, the compound can be closely related to the natural receptor to which TRICH 
binds, or to at least a fragment of the receptor, e.g., the ligand binding site. In either case, the i 
compound can be rationally designed using known techniques. In one embodiment, screening for 

10 these compounds involves producing appropriate cells which express TRICH, either as a secreted 
protein or on the cell membrane. Preferred cells include cells from mammals, yeast, Drosophila , or R 
cdi. Cells e^qpressing TOICH or cell membrane fractions which contain TRICH are then contacted 
with a test compound and binding, stimulation, or inhibition of activity of either TRICH or the 
compound is analyzed. 

15 An assay may simply test binding of a test compound to the polypeptide, wherein binding is 

detected by a fluorophore. radioisotope, enzyme conjugate, or othfer detectable label For example, 
the assay may comprise the steps of combining at least one test compound with TRICH, dther in 
solution or affixed to a solid siqjport. and detecting the binding of TRICH to the compound. 
Alternatively, the assay may detect or measure binding of a test compound in the presence of a 

20 labded competitor. Additionally, the assay may be carried out using cell-free prq>arations, chemical 
libraries, or natural product mixtures, and the test compound(s) may be free in solution or afBxed to a 
solid support 

TRICH of Uie present invention or fragments thereof may be used to screen for conqwunds 
that modulate the activity of TRICH. Such compounds may include agonists, antagonists, or partial 
25 or inverse agonists. In one ranbodiment, an assay is performed under conditions permissive for TRICH 
activity, wherein TRICH is combined with at least one test conqKJund, and the activity of TRICH intiie 
presence of a test compound is compared with tiie activity of TRICH in the absence of tiietest 
compound. A change in the activity of TRICH in the presence of the test conq)Ound is indicative of a 
compound that modulates the activity of TWCH. Alternatively, a test confound is combined witii an 
30 in vitro or cell-free system comprising TRICH under conditions suitable for TRICH activity, and ths 
assay is poformed. In dtha of these assays, a test compound which modulates tiie activity of TRICH 
may do so indirecfly and need not come in direct contact with tiie test compound. At least one and up to 
a plurality of test conqjounds may be saeaied 

In anotiier embodiment, polynucleotides encoding TRICH or their mammalian homologs may 
35 be "knocked out" in an animal modd system using homologous recombination in embryonic stem (ES) 
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cells. Suchtecbniques are weU known in the art and are useM for the g^oi^a 
human disease. (See, e.g., U.S. Pat^ Niimbo' 5,175,383 and U.S. Pateait Nmnbar 5,767,337.) For 
example, mouse ES cells, such as the mouse 129/SvJ cell line, are derived from the early mouse emibryo 
and grown in culture. The ES cells are transformed with a veaor containing the g^ of intarest 

5 disrupted by a mark^ gene, e.g., the neomycin phosphotransferase gene (neo; Capecchi, M.R. (1989) 
Science 244:1288-1292). The vector integrates into the corresponding region of the host geaiome by 
homologous recombination. Alternatively, homologous recombination takes place using llie Cre-loxP 
system to knockout a gene of interest in a tissue- or devdopm^al stage-specific manna: (Marth, J.D. 
(1996) Clin. Invest 97:1999-2002; Wagner, K.U. et al. (1997) Nucleic Acids Res. 25:4323-4330). 

10 Transformed ES cells are identified and miardiiyected into mouse cdl blastocysts such as those from 
the C57BL/6 mouse strain. The blastocysts are surgically transfoxed to pseudqpregnant dams, and the 
resulting chimeiic progmy are genotyped and bred to produce heterozygous or homozygous strains. 
Transg^c atmnals thus gmerated may be tested with potential therapeutic or toxic agents. 

Polynucleotides encodmg TRICH may also be manipulated in vitro in ES cdls doived from 

15 human blastocysts. Human ES ceDs liave the potential to differeitiate into at least dght sq)arate ceJl 
lineages including endodmn, mesoderm, and ectodomal cdl types. These cdl lineages differentiate 
into, for sample, neural cdls, hraiatopoietic lineages, and cardiomyocytes (Thomson, J.A et al. (1998) 
Sdence 282:1145-1147). 

Polynucleotides «icoding TRICH can also be used to create "knockin" humanized animals 

20 (pigs) or transgenic animals (mice or rats) to model human disease. With knockin tedmology, a region 
of a polynucleotide encoding TRICH is injected into animal ES cdls, and the iiyected sequence 
integrates into the animal cdl genome. Transformed cdls are injected into blastulae^ and the blastulae 
are implanted as described above. Transgenic progeny or inbred lines are studied and treated with 
potential pharmaceutical agents to obtain information on treatmeait of a himian disease. Altemativdy, a 

25 mammal inbred to overexpress TRICH, e.g., by secreting TRICH in its milk, may also s^e as a 
convenient source of that protein (Janne, J. et al. (1998) Biotechnol. Annu. Rev. 4:55-74). 
THERAPEUTICS 

Chemical and structural similarity, e.g., in the context of sequences and motifs, exists 
between regions of TRICH and transporters and ion channels. Therefore, TRICH spears to play a 
30 role in transport* neurological, muscle, and immunological disorders. In tbe treatment of disord^s 
associated with increased TRICH expression or activity, it is desirable to decrease the expression or 
activity of TRICH. In the treatment of disorders assodated with decreased TRICH expresaon or 
activity, it is desirable to increase the expression or activity of TRICH. 

Hierefore, m one embodiment, TRICH or a fragment or dmvative thereof may be 
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administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of TRICH. Examples of such disorders include, but are not limited to, a transport disorder 
such as akinesia, amyotrophic lateral sclerosis, ataxia telangiectasia, cystic fibrosis, Beclcer's 
muscular dystrophy, Bellas palsy, Charcot-Marie Tooth disease, diabetes mellitus, diabetes insipidus, 

5 diabetic neuropathy, Duchenne muscular dystrophy, hyperkalemic periodic paralysis, normokalemic 
periodic paralysis, Parkinson's disease, malignant hyperthermia, multidrug resistance, myasthenia 
gravis, myotonic dystrophy, catatonia, tardive dyskmesia, dystonias, peripheral neuropathy, cerebral 
neoplasms, prostate cancer, cardiac disorders associated with transport, e,g„ angina, bradyarrythmia, 
tachyarrythmia, hypertension. Long QT syndrome, myocarditis, cardiomyopafliy, nemahne 

10 myopathy, centronuclear myopathy, lipid myopathy, mitochondrial myopatiiy, thyrotoxic myopathy, 
ethanol myopathy, dennatomyositis, inclusion body myositis, infectious myositis, polymyositis, 
neurological disorders associated with transport, e,g., Alzheimer's disease, amnesia, bipolar disorder, 
dementia, depression, epilepsy, Tourette's disord^, paranoid psychoses, and schizophrenia, and othsc 
disorders associated witii transport, e.g., neurofibromatosis, postherpetic neuralgia, trigemmal 

15 neuropathy, sarcoidosis, sickle cell anemia, Wilson's disease, cataracts, infertility, pulmonary artery 
stenosis, sensorineural autosomal deafiiess, hyperglycemia, hypoglycmia. Grave's disease, goit^, 
Cushing's disease, Addison's disease, glucose-galactose malabsorption syndrome, 
hypCTcholestearolemia, adrenoleukodystrophy, Zellwega: syndrome. Menkes disease, occipital horn 
syndrome, von Gierke disease, cystinuria, iminogjydnuria, Hartup disease, and Fanconi disease; a 

20 neurological disorder such as epilepsy, ischemic cerebrovascular disease, stroke, cerebral neoplasms. 
Alzheimer's disease. Pick's disease, Huntington's disease, dementia, Parkmson's disease and other 
extrapyramidal disorders, amyotroifliic lateral sclerosis and othea* motor neuron disorders, progressive 
neural muscular atrophy, retinitis pigmentosa, hereditary ataxias, multiple sclerosis and other 
demyelinating diseases, bacterial and viral meningitis, brain abscess, subdural enq)yema, qpidural 

25 abscess, suppurative intracranial thrombophlebitis, myelitis and radiculitis, viral central nervous 
system disease, prion diseases including kuru, Creutzfeldt- Jakob disease, and Gerstmann- 
Straussler-Schdnker syndrome, fatjd familial insonmia, nutritional and metabolic diseases of the 
nervous system, neurofibromatosis, tuberous sclerosis, cerebeUoretinal hemangioblastomatosis, 
encephalotrigeminal syndrome, mental retardation and other developmental disorders of the central 

30 nervous system including Down syndrome, cerebral palsy, neuroskeletal disorders, autonomic 

nervous system disorders, cranial n^ve disorders, spinal cord diseases, muscular dystrophy and oibsr 
neuromuscular disorders, peripheral nervous system disorders, dermatomyositis and polymyositis, 
inherited, metabolic, endocrine, and toxic myopathies, myasthenia gravis, periodic paralysis, mental 
disord^s including mood, anxiety, and schizophrenic disorders, seasonal affective disorder (SAD), 

35 akathesia, amnesia, catatonia, diabetic neuropathy, tardive dyskinesia, dystonias, paranoid psychoses, 
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postherpetic neuralgia, Tourette's disonto, progressive supranudear palsy, corticobasal degeneration, 
and familial ftomtoteanporal dementia; a mnscle disorder sudi as cardiomyqpafliy, myocarditis, 
Duchenne's muscular dystrophy, Becker's muscular dystrpphy, myotonic dystrophy, central cote 
disease, nemaline myopathy, cenlronuclear myopathy, lipid myopathy, mitpchondrial myopathy, 

5 infectious myositis, polymyositis, dennatomyositis, inclusion body myositis, ttiyrotoxic myopaBiy, 
ethanol myopathy, angina, anaphylactic shock, arrhythmias, asthma, cardiovascular shock, Cushing's 
syndrome, hypertension, hypoglycemia, myocardial mfarction, migraine, pheodnromocytoma, and 
myopathies includmg encephalopathy, epilepsy, Keams-Sayfe syndrome, lactic acidosis, myoclonic 
disorder, ophthalmoplegia, and acid maltase deficiency (AMD, also known as Pompe's disease); and 

10 an immunological disordo: such as acquired immunodeficiency syndrome (AIDS), Addison' s disease, 
adult respiratory distress syndrome, aUergies, ankylosing spondylitis, amyloidosis, anemia, asflnna. 
aflierosdCTOsis, autoimmune hemolytic anemia, autoimmune thyroiditis, autoimmune 
polyraidocrinopathy-candidiasis-ectodennal dystrophy (APECED), bronchitis, cholecystitis, contact 
dermatitis, Oohn's disease, atopic dermatitis, dennatomyositis, diabetes meOitus, emphysona, 

15 episodic lynq)hopenia with lymphocytotoxms, eryfliroblastosis fetalis, erythema nodosum, atrophic 
gastritis, glomerulonephritis, Goodpasture's syndrome, gout, Graves' disease, Hashimoto's 
thyrcAditis, hypereosinophffla, irritable bowd syndrome, multiple sclerosis, myasthenia gravis, 
myocardial or pericardial inflammation, osteoarthritis, osteoporosis, pancreatitis, polymyositis, 
psoriasis, Reiter*s syndrome, riieumatoid arthritis, scleroderma. Sjogren's syndrome, systemic 

20 anaphylaxis, systemic lupus eryfliematosus, systanic sdarosis, thrombocytopenic purpura, ulcerative 
colitis, uveitis, Wexoei syndrome, complications of canca, hemodialysis, and extracorporeal 
circulation, viral, bacterijd, fungal, parasitic, protozoal, and hdminthic mfections, and trauma. 

In anoflier embodhnent, a vector capable of expressing TRICH or a fragment or daivative 
tiiereof may be admniistered to a subject to treat or prevent a disorder assodated with decreased 

25 expression or activity of HaCH induding, but not limited to, those described above. 

Ill a further embodimait, a conq)OSition conpisuig a substantially purified TRICH m 
coiyunctton witii a suitable pharmaceutical carrier may be admmisteredto a subject to treat (x prevoit a 
disorder associated with decreased expression or activity of TRICH induding, but not Ihnited to, tiiose 
provided above. 

30 In still another embodhnent, an agonist whidi modulates the activity of TRICH may be 

administered to a subject to treat or prevent a disorder assodated with decreased expression or activity 
of TRICH mcludhig, but not limited to, those listed above. 

In a further embodiment, an antagonist of TRICH may be administered to a subject to ti-eat or 
prevent a disorder assodated with inaeased expression or activity of TRICH. Examples of sudi 
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disorders include, but are not limited to. those transport, neurological, nmsde. and inmninological 
disorders described abov^ m one aspect, an antibody wWcb spedfically binds TRICH may be used 
directly as an antagonist or indirectly as a targeting or delivery mectoanismJor bringing a 
pharmaceutical agaat to cdls or tissues which express TRICH. 
5 in an additional embodiment, a vector expressing the conq)lement of the polynucleotide 

encoding TRICH may be administered to a subject to treat or prevent a disorder associated with ; 
u^reased ejqiression or activity of TRICH including, but not limited to, those desafted above. ] 

toother mbodimeats. any of flie proteins, antagonists, antibodies, agonists, complem^ 
sequences, or vectors of the invention may be adndnistered in combination with other appropriate 
10 tiierapeutic agents. Sdection of tiie appropriate agents for use m combination tiierapy may be made by 
oneofordinaryskintoflieart.accordingtocoiiventiQnalphannaceuticalprind The combination 
of tiierapeuttc agents may act synergisticany to effect tiieti:eatment or prevention of the various 
disorders described above. Using tins s?)proach, one may be able to achieve thfxapeutic efficacy witii 
lower dosages of each agent, tims reducing ttie potential for adverse side effects. 
15 An antagonist of TRICH may be FOduced using mettjods whidi are generally known hitiie art 

In particular, purified TRICH may be used to produce antiiwdies or to screen libraries of 
pharmaceutical agentstoidendfytiioseA^^ichspedficanybindTRICH. Antibodies to TRICH may 
also be generated using methods tiiat are wdl Imown in tiie art Such antibodies may indude, but are 
not limited to, polyclonal, monoclonal, chimeric, and single diain antibodies. Fab fragments, and 
20 fragmentsproducedbyaFabexpressionhUrary. Neutraliztag antibodies (i.e..ttiosewhidi inhibit 
dimer formation) are geaia-ally prefored for tiierapeutic use. 

For tiie production of antibodies, various hosts including goats, rabbits, rats, mice, humans, 
and otiiers may be immunized by injection witii TRICH or witii any fragment or oligc?)q)ti<te thereof 
whichhas immunogenic properties. Dq)ending on tiie host species, various adjuvants may be used to 
25 taaeasehnmmiologicalresponse. Such adjuvants include, but are not limited to, Freund's.minaral gels 
sudi as aluminumhydroxide, and surface active substances sudi as lysoledtiiin, pluronic polyols, 
pdyanions. pqrtides. ofl emulsions. KLH. and dinitrophenoL Among adjuvants used in humans, BCG 
(bacim Calmette-Guerin) and rnrYnfthacterinmparvum are espedally preferable. 

It is preferred tiiat tiie digopq)tides, pqJtides. or fragments used to induce antibodies to 
30 TRICHhaveanaminoaddsequeaicecQnsistingofatleastab6ut5amtoDadds.andgaiCT^^ 
consist of at least about 10 amhio adds. It is also preferable tiiattiieseoligppq)tides, pqitides, or 
fragments areidenticaltoaporttonoftiieaminoaddsequenceofttienaturalprotdn. Short stretches of 
TRICH ammo adds may be fused witti tiiose of anoflier protdn, sudi as KLH. and antibodies to tiie 
chhneric molecule may be produced. 
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Monoclonal antibodies to TRICH may be prepared using any technique wMdi provides for the 
production of antibody molecules by continuous cell lines in culture. These include* but are not Innited 
to, the hybridoma technique, the human B-cell hybridoma technique, and the EB V-hybridoma 
technique. (See. e.g., KoMer, G. et al. (1975) Nature 256:495497; Kozbor, D. et al. (1985) J. 

5 Immunol. Methods 81:31-42; Cote, RJ. et al. (1983) Proc. Nafl. Acad ScL USA 80:2026-2030; and 
Cole, S.P. et al. (1984) Mol. CeQ Biol. 62:109-120.) 

In addition, techniques developed for the production of "chimeric antibodies," such as the 
splicing of mouse antibody genes to human antibody genes to obtain a molecule with apprq)riate 
antigen specificity and biological activity, can be used (See, e.g., Morrison, S,L. et al. (1984) Proc. 

10 Natl. Acad Sd, USA 81:6851-6855; Neub^ger, M.S, et al. (1984) Nature 312:604-608; and Takeda, 
S. et al. (1985) Nature 314:452-454.) Alternatively, techniques described for tiie production of single 
chain antibodies may be adapted using methods known in the art, to produce TRICH-specific single 
chain antibodies. Antibodies witii rdated specificity, but of distinct idiotypic composition, may be 
gena-ated by chain $h|iffling from random combinatorial immunoglobulin libraries. (See, e.g., Burton, 

15 D.R. (1991) Proc. Nati. Acad Sci. USA 88:10134-10137.) 

Antibodies may also be produced by inducing mvivo production in the lymphocyte population 
Q£ by screening imraunoglobiilin Hbraries or panels of highly specific binding reagents as disclosed in 
tiie literature. (See, e.g., Qrlandi, R. et al. (1989) Proc. Nati. Acad Sd. USA 86:3833-3837; Winter, 
G. et aL (1991) Nature 349:293-299.) 

20 Antibody fragmoits which contain specific binduig sites for TRICH may also be generated ' 

For example, such fragments mclude, but are not limited to, F(ab02 fragments produced by pepsm 
digestionof the antibody molecule and Fab fragmmts generated by reducii^ the disulfide bridges of the 
F(ab')2 fragments. Alternatively, Fab expression libraries may be constructed to allow raqpid and easy 
identification of monoclonal Fab fragmeaats with the desired specificity. (See, e.g., Huse, W.D. et al. 

25 (1989) SdCTCe 246:1275-1281.) 

Various immunoassays may be used for screening to identify antibodies having the desired 
specificity. Numerous protocols for competitive binding or immunoradiometric assays using eitha: 
polyclonal or monoclonal antibodies with established specificities are well known in the art. Sudi 
inununoassays typically involve the measurement of complex formation between TRICH and its 

30 specific antibody. A two-site, monoclonal-based immunoassay utilizing monoclonal antibodies reactive 
to two non-interfering TRICH epitopes is generally used but a competitive binding assay may also be 
CToployed (Pound supra) . 

Various methods such as Scatchard analysis in conjunction with radioimmunoassay techniques 
may be used to assess the affinity of antibodies for TRICH. Affinity is expressed as an association 
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constant, K^, wWdi is defined as Uie molar concentration of TRICH-antibody conaplex divided by tbe 
molar concentrations of free antigen and free antibody nnder equilibrium conditions. The detennined 
for a preparation of polyclonal antibodies, which are heterogmeous in Ihdr affinities for multiple 
TRICH epitopes, represents the average affinity, or avidity, of the antibodies for TRICH. The Ka 

5 detennined for a preparation of monoclonal antibodies, which are monospecific for a particular TRICH 
epitope, represents a true measure of affinity, ffigh-affinity antibody preparations with ranging from 
about 10' to 10^^ L/mole are preferred for use in immunoassays in wMch the TRICH-antibody complex 
must withstand rigorous manipulations. Low-affinity antibody preparations with ranging from 
about 10^ to 10^ L/mole are preferred for use in immunopurification and similar procedures which 

10 ultimately require dissociation of TRICH, prrferably in active form, from the antibody (Catty, D. 
(1988) Antibodies. Volume I: A Practical Approach , IRL Press, Washington DC; Lidddl, iJE. and A. 
Cryear (1991) A Practical Guide to Monoclonal Antibodies, John Wiley & Sons, New York NY). 

The titer and avidity of polyclonal antibody preparations may be ftirther evaluated to determine 
the quality and suitability of such prq)arations foe certain downstream applications. For example, a 

15 polyclonal antibody preparation containing at least 1-2 mg specific antibody/ml, preferably 5-10 mg 
specific antibody/ml, is generally employed in procedures requiring precipitation of TRICH-antibody 
coin)l«ces. Procedures for evaluating antibody specificity, titar, and avidity, and guiddines for 
antibody quality and usage in various applications, are gaia-ally available. (See, e.g.. Catty, supra, and 
Coligan et al. supra .^ 

20 In another ^bodhn^t of the invention, the polynucleotides encoding TRICH, or any firagment 

or complemrat thereof, may be used for therapeutic purposes. In one aspect, modifications of gene 
^ression can be achieved by designing conoplem^itary sequences or anlisaise molecules (DNA, RNA, 
PNA, or modified oligonucleotides) to the coding or regulatory regions of the gene encoding TRICH. 
Such technology is wdl known in the art, and antis^e ohgonucleotides or larger firagments can be 

25 designed firom various locations along the coding or control regions of sequences encodfaig TRICH. 
(See, e.g., Agrawal, S., ed. (1996) Antisense Therapeutics , Humana Press Inc., TotawaNL) 

In therapeutic use, any gene delivery system suitable for introduction of the antisense 
sequences into appropriate target cells can be used. Antisense sequences can be delivered 
intracellularly in the form of an expression plasmid which, upon transcription, produces a sequence 

30 complementary to at least a portion of the cellular sequence encoding the target protein. (See, e.g.. 
Slater, J.E. et al. (1998) J. Allergy Ch. hnmunol. 102(3):469-475; and Scanlon, K.J. et al. (1995) 
9(13):1288-1296.) Antisense sequences can also be introduced intracellularly through the use of viral 
vectors, such as retrovirus and adeno-associated vhus vectors. (See, e.g.. Miller, A.D. (1990) Blood 
76:271; Ausubel, supra : Uckert, W. and W. Walther (1994) Pharmacol. Ther. 63(3):323-347.) Other 
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gene deUvery mechanisms Include Uposome-derived systems, artifidal viral envelopes, and oflier 
systems known in the art (See. e.g.. Rossi. JJ. (1995) Br. Med. Bnll. 51(l):217-225; Boado. RJ. et 
al. (1998) J. Pharm. Sd. 87(1 1):1308-1315; and Moms. M.C. et sfl. (1997) Nuddc Adds Res. 
25(14):2730-2736.) 

5 In another embodiment of the invention, polynucleotides encodhig TRICH maybeusedfor 

somatic or germUne gene therapy. Gene therapy may be performed to (i) coirect a gaietic defidency 
(e.g.. in the cases of severe combined immunodeadency (SCID)-X1 disease charaderized by X-linked 
inheritance (Cavazzana-Calvo, M. et al. (2000) Sdence 288:669-672), severe combined 
iniraunodefidency syndrome associated witb an inherited adenosine deaminase (ADA) deficiency 

10 (Blaese. R.M. et aL (1995) Sdence 270:475-480; Bordignon. C. et al. (1995) Sdence 270:470-475). 
cystic fibrosis (Zabner. J. et al. (1993) Cdl 75:207-216; Crystal. R.G. et al. (1995) Hum Gene 
'niersq)y 6:643-666; Crystal. R.G. et aL (1995) Hum. Gene lUerapy 6:667-703). thalassamias, familial 
hyperdiolesterolemia, andhanqphilia resuWng fromFadior vm or Factor IX deficiencies (Crystal. 
R.G. (1995) Sdence 270:404-410; Venna. I.M. and N. Somia (1997) Nature 389:239-242)). (ii) 

15 express a conditionally lethal geneprodud (e.g.. in the case of cancers whidiresult frommiregulated 
cdl proliferation), or (ii!) express a protdnwhidi affords protection against intracdlular parasites (e.g.. 
. against human retroviruses, sudi as human immunodeficiency virus (HIV) (Baltimore, D. (1988) 
Nature 335:395-396; Poesdila. E. et aL (1996) Proc. NatL Acad. Sd. USA. 93:11395-1 1399). 
hepatitis B or C virus (HBV. HCV); fungal parasites, sudi as Candida albicans and ParacocddiQides 

20 brasiliensis ; and protozoan parasites sudi as Piasmnninm faldoarum and TrYpanQ?oma cruzi) . In the 
case where a genetic defidency in TRICH expression or regulation causes disease, Ihe expression of 
TRICH from an appropriate population of transduced ceffls may alleviate the dinical maMestatians 
caused by the genetic defidency. 

In a further embodiment of the invention, diseases or disorders caused by defidendes in 

25 TRICH are treated by constructing mammalian expression vectors encoding TRICH and introducing 
lliese vectors by medianical means into TRICH-defident cdls. Medianical transfer tedmologies for 
use witti cdls in vivo or ex vitro include (i) dired DNA microinjedion into individual cdls. (ii) ballistic 
gold particle delivery, (iii) liposome-mediated transfection, (iv) receptor-mediated gene transfer, and (v) 
tiieuseof DNAtransposons (Morgan, R.A. and W.F. Anderson (1993) Annu. Rev. Biodiem. 62:191-. 

30 217; Mcs. Z. (1997) Cdl 91:501-510; Boulay. J-L. and H. R6dpon (1998) Curr. Opin. BiotedmciL 
9:445-450). 

Expression vedors that may be effective for the expression of TRICH include, but are not 
limited to. the PCDNA 3.1, EPITAG, PRCCMV2. PREP. PVAX vedors (Invitrogea, Carlsbad CA). 
PCMV-SCRIPT. PCMV-TAG, PEGSH/PERV (Stratagene. La Jblla CA). and PTET-OFF. 
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PTET-ON, PTRE2, PTRE2-LUC, PTK-HYG (Qontech, Palo Alto CA). TRICH may be eaqjressed 
using (i) a constitatiYdy active proniDter. (e.g., from cytomegalovirus (CMV), Rous sarcoma virui 
(RSV), SV40 virus, thymidine kinase (TK), or p-actin genes), (ii) an indudble promoter (e.g.. the 
tetracycline-regulated promota (Gossen. M. and H. Bnjard (1992) Proc. NatL Acad Set U.S.A. 
5 89:5547-5551; Gossen. M. et al. (1995) Sdaice 268:1766-1769; Rossi. F.M.V. andH.M. Blau (1998) 
Curr. Opin. Biotedmol. 9:451-456). commercially available in the T-REXplasmid (Invitrogen)); the 
ecdyson^-induclble promoter (available in the plasmids PVGRXR and FIND; Invitrogen); the | 
FK506/rapamycin inducible promoter; or the RU486/mifepristone inducible promoter (Rossi. F.M.V. 
and Blau. H.M. supra) ), or (iii) a tissue-specific promoter or the native promoter of the endogenous 
10 gene aicodutg TRICH from a normal individual. 

Commerciany available liposome transformation kits (e.g., the PERFECT LIPID 
TRANSFECnON KIT, available from Invitrogoi) allow one with ordinary skill m the art to deJiver 
polynucleotides to target cdls in culture and require minimal effort to optunize experimental 
parameters. In the alternative, transformation is performed using the calcium phosphate method 
15 (Graham. Fi. and A.J. Eb (1973) Virdogy 52:456-467), or by dectrqporation (Neumann, E. et al. 
(1982) EMBO J. 1:841-845). The introduction of DNA to prhnary ce31s requires modification of tijese 
staiyiqrrtiyf^ TnaTQinjilian transfection protOCOls. 

In anoflier embodiment of the invention, dfeeases or disorders caused by genetic defects witii 
respect to TRICH expression are treated by constracting a reteovirus vector consisting of (i) the 
20 polynucleotide encodmg TRICH under tiie control of an hidq)endeat promoter or the • 
tetmmal repeat (LTR) promoter, (ii) ^opriate RNApackaging signals, and (iii) a Rev-responsive 
element (RRE) along witti additional retrovirus cw-acting RNA sequences and codhig sequmces 
required for efficient vector propagation. Retirovirus vectors (e.g., PFB and PFBNEO) are 
commercially available (Stratagene) and are based on published data (Riviere, I. et al. (1995) Proc. 
25 Nati. Acad. Sd. U.S.A 92:6733-6737), incorporated by rdferaicehereia Ihe vector is propagated hi 
an appropriate vector producing cell Une (VPCL) that expresses an envdope gene witii a tiropism for 
recq)tors on tiie target cells or a promiscuous envelope protdn such as VSVg (Armentano, D, et al. 
(1987) J. Virol. 61:1647-1650; Bender, M.A et al. (1987) J. Virol. 61:1639-1646; Adam, M.A and 
AD. Miller (1988) J. Virol. 62:3802-3806; Dull, T et al. (1998) J. Virol. 72:8463-8471; Zufferey. R 
30 et al. (1998) J. ViroL 72:9873-9880). U.S. Patent Number 5.910.434 to Rigg ("Method for obtatoing 
retrovirus packaging cdl Imes produdng high tiransducing effidency retroviral supernatant") discloses a 
mefliod for obtaining retrovirus padcaging cdl Imes and is herd)y incQq)oratedby reference. 
Propagation of retrovirus vectors, transduction of a population of ceills (e.g., CD4* T-cells). and tiie 
return of transduced cdls to a patient are procedures wdl known to persons skilled ui tiie art of gene 
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therapy and have beea wdl docaunmted (Ranga, U. et al. (1997) J. Virol. 71:7020-7029; Bauer, G. et 
ai (1997) Blood 89:2259-2267; Bonyhadi, MJL. (1997) J. ViroL 71:4707-4716; Ranga. U. et al. 
(1998) Proc. Natl, Acad. Sd. U.S.A 95:1201-1206; Su, L. (1997) Blood 89:2283-2290). 

in tbe alternative, an admovirus-based gme ihexspy delivery system is used to deliver 
5 polynudeotides encoding TRICH to cells which have one or more gmetic abnOTmalities with respect to 
the expression of TRICH. The construction and packaging of adenovirus-based vectors are well known 
to those with ordinary skill in the art. Rq)lication defective adenovirus vectors have proven to be 
versatile for importing genes encoding immunoregulatory proteins into intact islets in the pancreas 
(Csete, M.E. et al. (1995) Transplantation 27:263-268). Potentially useful adenoviral vectors are 

10 described m U.S. Patent Number 5,707,618 to Armentano ("Adenovirus vectors for gene therapy"), 
hereby incorporated by refer^ce. For adenoviral vectors, see also Antinozzi, P.A et al. (1999) Anna. 
Rev. Nutr. 19:511^544 and Verma. LM. and N. Somia (1997) Nature 18:389:239-242, both 
incorporated by reference herein. 

In another alternative, a hopes-based, g«ie thorapy delivery syst^ is used to deJiver 

15 polynucleotides encodmg TRICH to target cdls which have one or more genetic abnormialities with 
respect to the expression of TRICH. The use of h^es simpler virus (HS V)-based vectors may be 
especially valuable fcM: introducmg TRICH to cdls of the caitral narvous system, for whidi HSV has a 
trq>isnL The construction and packaging of herpes-based vectors are well known to those with 
ordinary skill in the art. A rqplication-competent herpes simplex virus (HSV) type 1-based vector has 

20 been used to ddiva a reporto: geme to the eyes of primates (Liu, X, al. (1999) Bxp. Eye 

Res.l69:385-395). The conshiiction of a HSV-1 \drus vector has also been disclosed in detail in U.S. 
Patait Nmhbo: 5,804,413 to DeLuca ("Hopes sin5)lex virus strains for gene transfer"), which is 
hereby incorporated by referoice. U.S. Patent Number 5,804,413 teaches the use of recombinant HSV 
d92 which consists of a genome containing at least one exogenous g&ne to be transferred to a cell under 

25 the control of the appropriate promote: for purposes including human gene therapy. Also taught by this 
patent are the construction and use of recombinant HSV strains deleted for ICP4, ICP27 and ICP22. 
Fcm: HSV vectors, see also Goms, W.F. et al. (1999) J. Virol 73:519-532 andXu, H. et al. (1994) Dev. 
Biol, 163:152-161, hereby incorporated by reference. The manipulation of cloned herpesvirus 
sequences, the generation of recombinant virus following the transfection of multiple plasmids 

30 containing different segments of the large herpesvirus genomes, the growth and propagation of 

herpesvirus, and the infection of cells with herpesvirus are techniques well known to those of ordinary 
skill in the art. 

In another alternative* an alphavirus (positive, single-stranded RN A virus) vector is used to 
deliver polynucleotides encoding TRICH to target cdls. The biology of ttie prototypic alphavirus. 



51 



wo 01/46258 



PCT/USOO/35095 



SanUki Forest Virus (SFV), has been studied exteaosively and gene transfer vectors have beem based m 
the SFV genome (Garoff, H. and K.-J. li (1998) Cuir. Opin. Biotedmol. 9:464-469). During ' 
alphavirus RNArepUcation, a subgenomic RNAis generated that ncirmally encodes the viral capsid 
proteins. This subgenomic RNArqplicates to higher levds than the ftfll length genomic RNA, resulting 
5 in the oveiproduction of c^sid proteins relative to the viral proteins with enzymatic activity (e.g., 
protease and polymerase). Simnarly. inserting the coding sequence for TRICH into the alphavirus 
g^ome in place of the capsid-coding region results in the production of a large number of TRICH- 
coding RNAs and the synthesis of high levds of TRICH in vector transduced ceils. While alphavirus 
infection is typically associated with ceU lysis within a few days, the abiUty to establish a persistent 
10 infection in hamster normal kidney cdls (BHK-21) with a variant of Sindbis virus (SIN) indicates that 
the lytic replication of aJphaviruses can be altered to suit the needs of the gene therapy application 
(Dryga, S.A. et al, (1997) Virology 228:74-83). The wide host range of alphaviruses win allow the 
introduction of TRICH into a variety of ceil types. The specific transduction of a subset of ceils in a 
population may require the sorting of ceils prior to transduction. The mdhods of manipulating 
15 mfectious cDNA dones of alphaviruses, performmg alphavirus cDNA and RNA t:ansfections, and 
performing alphavirus infections, are weil known to tiiosewitti ordinary skill in the art. 

Oligonucleotides derived from the transcription initiation site, e.g., between about positions -10 
and +10 frran the start site, may also be assptoyed to inhibit gaoe ejqwession. Simflarly, inhibition can 
be achieved using triple hdix base-pairing methodology. Tr^le hdix pairing is useful because it causes 
20 hiMbition of the ability of the double heJix to open suffidenlly for the binding of polymerases , 

transcription factors, or regulatory molecules. Recent ther^tic advances using triplet DNA have 
been described in tiie literature. (See, e.g.. Gee, JJE. et aL (1994) in Huber, BB. and B.I. Can, 
Molecular and Trnmunologic Annroadies. Futura Publishing, Mt Kisco NY, pp. 163-177.) A 
complementary sequence or antisense mdecule may also be designed to block translation of mRNAby 
25 preventing the transcript from binding to ribosomes. 

Ribozymes, enzymatic RNA molecules, may also be used to catalyze the spedfic deavage of 
RNA The mechanism of ribozyme action involves sequcaice-specific hybridization of tiie ribozyme 
molecule to complementary target RNA followed by endonucleolytic cleavage. For example, 
oigineered hammeihead motif ribozyme molecules may specifically and effidaitiy catalyze 
30 eadatiucleolytic cleavage of sequences encoding TRICH. 

Specific ribozyme cleavage sites wittiin any potaitial RNA target are initially idaitified by 
scanning ti» target molecule for ribozyme deavage sites, Induding the following sequences: GUA 
GUU, and GUC. Once identified, short RNA sequences of between 15 and 20 ribonucleotides, 
conresponding to tiie region of the target gaie containing ti» deavage site, may be evaluated far 
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secondary structural features wbidi may reiMkxIlieoIlgCMMde^^ The suitability of 

candidate targets may also be evaluated by testing accessiWty to hyhridizatlQQwimcomplen^ 
oligonucleotides using ribonuclease protection assays. 

Coraplem^y ribonucldc add molecules andribozymes of the invention may be prepared by 

5 any method known in the art for the synthesis of nucldc acid molecules. These include techniques for 
diemically synthesizing oUgonucleotides sudi as solid phase phosphoramidite diemical synthesis. 
Altemativdy, RNA molecules my be generated by in^dfro and in^ovo transcription of DNAse(pen^ 
encodmg TRICR Sudi DNA sequences may be incorporated into a wide variety of vectors with 
suitable RNA polymerase promoters sudi as T7 or SP6. Alternatively, these cDNA construds that 

10 synthesize con5)lementary RNA, constitutivdy or indudbly. can be introduced into cdl lines, cdls, or 
tissues. 

RNA molecules may be modified to increase iutracdlular stability and balf-life. Possible 
modifications indude, but are not limited to, the addition of nanktog sequences atthe5'and/or 3'ends 
of the molecule, ortiiense of pJiosphorothioate or T 0-methyl rather than phosphodiesterase linkages 
15 wilWnlhebadcbone of tiie molecule. This concqrt is inherent in the production of PNAs and can be 
extended in all of fliesemdecules by tiiehichisionof nontradWonalbases sudi as hiosine. queoshie, and 
wybutosine, as wdl as acetyl-, methyl-. Ihio-, and simflarly modified fonns of adenine, cytidme. 
guamne, thymine, and uridine wbidi are not as easfly recognized by endogenous emJonucleases. 
An additional embodiment of tiie mvention encon5)asses a method far screening for a 
20 compound whidi is effective hi alterhig expression of a polynucleotide encoding TRICH. 

Conqwunds whidi may be effective in altering esqiression of a spedfic polynudeotide may hudude. 
but are not Ihnited to, oligonucleotides, antisense oligonucleotides, triple hdix-forming 
oligonucleotides, transcription factors and other polypeptide transcrqrtional regulators, and non- 
maaomoleculai di^nical Mitities whidi are capable of mterading with spedfic polynndeotide 
25 sequences. Effedive compounds may Jdter polynucleotide ejqjression by ading as d1heruihfl)i^ 
promoters of polynucleotide eqjtession. Thus, hi the treatment of disorders assodated with hicreased 
TRICH expession or activity, a compound which spedficaUy inhibits expression of tiie 
polynucleotide encodhig TRICH may be tiierapaitically useful, and m the treament of disorders 
assodated with decreased TRICH e:5iression or adivity, a compound whidi spedfically promotes 
30 expression of the polynucleotide encodmg TRICH may be ther^)eu1icany usefuL 

At least one, and up to a plurality, of test compounds may be screened for effectiveness hi 
altering expression of a specific polynucleotide. A test compound may be obtamed by any method 
commonly known in tiie art, includmg chemical modification of a compound known to be effective m 
altermg polynucleotide expression; selection ftom an existing, commercially-available or proprietary 
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library of naturany-occuning or non-natuna chemical compounds; rational design of a compound 
based on chemical and/or structural properties of the target polynucleottde; and selection fiom i 
library of chemical compounds created combinatorially or randomly. A sample comprising a 
polynucleotide encoding TRICH is exposed to at least one test compound thus obtained. THe sample 

5 may comprise, for example, an intact or permeabilized cdl, or an inAgtafi cdl-free or reconstituted 
biochemical system. Alterations in the expression of a polynudeotide encoding TRICH are assayed 
by any method commonly known in the art Typically, the expression of a specific nucleotide is 
detected by hybridization wifli a probe having a nucleotide sequence complementary to ttie sequence 
of the polynucleotide encoding TRICH. The amount of hybridization may be quantified, thus 

10 forming tiie basis for a comparison of the expression of the polynucleotide botii with and without 
exposure to one or more test compounds. Detection of a change in the expression of a polynucleotide 
exposed to a test compound indicates tiiat ttie test compound is effective in altering tiie expression of 
tiie polynucleotide. A saeen for a compound effective m altering expression of a specific 
polynucleotide can be carried out, for example, using a Schizosaccharomyces pombe gene expression 

15 system (Atkins. D. et al. (1999) U.S. Patent No. 5,932,435; Amdt, G.M. et al. (2000) Nucleic Acids 
Res. 28:E15) or a human cell line such as HeLa cell (Clarke. Mi. et al. (2000) Biochem. Biophys. 
Res. Commun. 268:8-13). A particular embodiment of tiie present invention involves screening a 
combinatorial Ubrary of oligonucleotides (such as deoxyribonucleotides, ribonucleotides, peptide 
nudeic adds, and modified oligonudeotides) for antisense activity against a specific polynucleotide 

20 sequence (Bruice, T.W. et al. (1997) U.S. Patent No. 5,686,242; Bruice, T.W. et al. (2000) U.S. 
Patent No. 6,022,691). 

Many mettiods for introducing vertcffs into cells or tissues are available and equally suitable for 
use in vivo, in vitro, and ex vivo . For ex vivo tiierapy. vectors may be inOroduced into stem cdls taken 
from the patient and donafly propagated for autologous transplant back rato tiiat same patiait. 
25 Ddivery by transfection. by liposome mjections, or by polycationic amino polymers may be achieved 
using mefliods whidi are wdl knovwi in tiie art. (See, e.g., Goldman. C.K. et aL (1997) Nat 
Biotedmol. 15:462-466.) 

Any of the therapeutic metbods described above may be applied to any subject in need of suCh 
flier^y, indnding, fOT exan?)le, mammals sudi as humans, dogs, cats, cows, horses, rabbits, and 
30 monkeys. 

An additional aiibodtaient of tiie mvention rdates to tiie administiration of a composition which 
generany coraptises an active ingredient formulated witti a pharmaceutically accqjtable exdpient. 
Ex(^)ients may uidude, for ecample, sugars, starches, celluloses, gums, and proteins. Various 
formulations are commonly known and are tiioroughly discussed in tiie latest edition of RaninRton's 
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Ph.rn.....tir.al Sdeaces (Maack Publishing. EastonPA). Such compositions may consist of TRICH, 
antibodies to TRICH, and mimetics, agonists, antagonists, or inMbitors of TRICH. 

The compositions utilized in this invention may be administered by any number of roo^ 

inctading. but not limited to, oral, intravenous, intramuscular, intra-art^al, intramedullary, intrathecal, 
5 iniravenlriailar. pulmonary, transdermal, subcutaneous, intraperiton^^^ 

sublingual, or rectal means. 

Can^KJSitions for pulmonary administrationmay be prepared in Uquid or dry powder form. 

TUfise compositions are generafly aerosolized immediatdy prior to inhalation by the patiait. Intbe case 
of small mcdecules (e.g. traditional low molecular vvdght organic drugs), aerosol ddivery of fast-acting 
10 formulations is wdl-knowninthe art m the case of maaomolecules (^g. larger peptides and protdns). 
rec«rt devdopments inthefidd of puhnonary ddivery via the alveolar re^on of thelung have enabled 
thepraclical ddivery of drugs such as insulin to blood drculation (see. e.g„ Patton. J.S. et aL, U.S. 
PatentNo. 5,997,848), Pdnwnary ddivery has the advantage of administration without needle 
uijection. and obviates the need for potentially toxic penetration enhances. 
15 Compositions suitable for use in the invention include con?)ositians wherdnthe active 

■ ingreaentsarecontahiedinan^fectiveamounttoadnevethehtfende^ Thedetmninationof 
an effective dose is well within the capability of those skffled in the art 

Specialized forms of compositions may be prq)ared for direct intracdlular ddivery of 
maaomolecules comprising TRICH or fragments thereof. FQr acample, liposome prqjarations 
20 containing a ceB-impermeable macromdlecule may promote cdl fusion and intracdlular ddivery of the 
macromolecule. Altemativdy. TRICH or a fragment thereof may be joined to a short cationic N- 
terminalportionfromthBHIVTat-lprotdn. Fusion protdns thus generated have been found to 
transduce into the cells of all tissues, induding the brain, in a mouse modd system (Sdiwarze, S.R. et 
al. (1999) Sdence 285:1569-1572). 
25 For any compound, the therapeutically effective dose can be estimated initially diher hi cdl 

culture assays. e.g.. of neoplastic cdls. or in animal models such as mice, rats, rabbite. dogs, monkeys, 
orpigs. Ananimalmoddmayalsobeusedtodetermhietheappropriateconcenfrationrangeandroute 
Qfadmtaistration. Sudimformation can then be used to determine useful doses and routes for 
administration in humans. 

30 Athfir^ticanyeffectivedosereferstolhatamountQfactiveingredientforexampleTO^ 
or fragments thereof, antibodies of TRICH. and agonists, antagonists or hjhibilDrs of TRICH. whidi 
amdiorates the sympttans or condition. Therapeutic efBcacy and toxicity may be determhied by 
standard pharmaceutical procedures in cdl cultures or with experhnental animals, sudi as by 
calculating IheEDso (the dose therapeutically eHectivehi50% of the population) or LD50 (the dose 
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lethal to 50% of llie population) statistics . The dose ratio of toxic to tharapeutic eCfects is the 
therq)aitic index, wMchcanbe expressed as the LDso^ CompQsitiQnsvvhidi exhibit large 

therapeutic indices are preferred. The data obtained from cell culture assays and animal studies are 
used to formulate a range of dosage for human use. The dosage contained in such compositions is 

5 pref^ably within a range of circulating concentrations that includes the ED50 withlitfle or no toxicity. 
The dosage varies within this range depending upon the dosage form employed, the s^itivity of the 
patient, and the route of administratioa I 

The exact dosage will be detmnined by the practitioner, in light of factors related to the!subject 
requiring treatment Dosage and administration are adjusted to provide sufficient levels of the active 

10 mcdety or to maintain the desired effect. Factors which may be taken into account include the severity 
of the disease state* the gaieral health of the subject, the age, weight, and gender of the subject, time 
and frequmq? of administration, drug combmatiQn(s), reaction sensitivities, and response to thaapy. 
Long-acting compositions may be adnunistered every 3 to 4 days, every week, or biweekly depeaiding 
on the half-life and clearance rate of the particular formulation. 

15 Normal dosage amounts may vary from about 0. 1 Mg to 100,000 /^g, up to a total dose of 

about 1 gram, depending upon the route of admmistration. Guidance as to particular dosages and 
methods of ddivery is provided in tiie litaature and gaaarally available to practitioners in the art. 
Those sMned in the art win employ different formulations for nucleotides than for protdns or thdr 
inhibitors. Smularly, ddivery of polynucleotides or polypeptides will be specific to particular ceils, 

20 conditions, locations, etc. 
DIAGNOSTICS 

In another eamibodunent, antibodies \^*idi spedficaUy bind TRIG H may be used for the 
diagnosis of disorders characterized by expression of TRIGH, or in assays to monita patients bdng 
treated witii TRICH or agonists, antagonists, or inhibitors of TRICH. Antibodies useful for diagnostic 

25 purposes may be prepared in the same manner as described above for ttiarapeutics. Diagnostic assays 
for TRICH include methods which utilize the antibody and a label to detect TRIGH in human body 
fluids or in extracts of cells or tissues. The antibodies may be used witii or wittiout modification, and 
may be labeled by covalent or non-covalent attachment of a reporter molecule. A wide variety of 
reporter molecules, several of which are described above, are known in the art and may be used. 

30 A variety of protocols for measuring TRICH, including ELIS As, RIAs, and FAGS, are known 

in the art and provide a basis for diagnosing altered car abnormal levels of TRICH expression. Normal 
or standard values for TRIGH expression are established by combining body fluids or cell extiracts 
taken from normal mammalian subjects, for exan^le, human subjects, wflh antibodies to TRICH under 
conditions suitable for complex f ormatioa The amount of standard complex fOTuation may be 
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quantitated by various metbods. sucb as photometric means. Quantities of TRICH repressed in 
subject, control, and disease samples from biopsied tissues are compared vdtti tbe standard values. 
Deviatiaubetweea standard and subject values establishes theparameters for diagnosing disease. 

m another embodiment of the invention, the polynucleotides encoding TRICH may beusedfor 

5 diagnostic purposes. The polynucleotides which may be used indude oligonucleotide sequaices, 
can?)lemfiDtaryRNAandDNAmolecules. andPNAs. The polynucleotides may be used to detect and 
quantify gene expression in biopsied tissues in which expression of TRICH may be correlated with 
disease. TTiediagnostic assay may beusedto determine absence, preseuce, and ejccess expression of 
TRICH, andto monitor regulation of TRICH levels during tiierapeatic intervention. 

10 mone aspect, hybridization with PCR probes which arecapable of detecting polynucleotide 

sequences, including genomic sequences, encoding TRICH or closdy related molecules may be used to 
identifynudeicacidsequenceswhichencodeTRICH. Thespedfidtyoftheprobe. whetiieritis made 
from a highly spedfic region. e.g.. the 5' r^tory region, or from a less spedfic region, e.g., a 
conserved motif , and the stringency of the hybridization or anqflification win determine whether tiie 

15 probeidaitifiesoiiynaturallyoccurringsequaK«sencodingTRICH.dldicvari^^ 

sequences. 

Prdbesmayalsobeusedforthedetectionofrdatedsequences.andmayhaveatleast50% 

sequenceidentitytoanyoftiieTRICHencodingsequences. The hybridization probes of the subject 
invention maybe DNA or RNA and may bederivedfromthe sequence of SEQ ID NO:28-54 or from 

20 genomic sequences including promoters. «jhancers, and hilrons of tiie TRICH gene. 

Means for producmg spedfic hybridization probes fijr DNAs encoding TRICH indudeihe 
cloning of polynucleotide sequences encoding TRICH or TRICH derivatives into vectors for the 
production of mRNAprobes. Sudi vectors are known hi the art, are commerdally available, and may 
be used to synttiesize RNA probes in vitro by means of the addition of the appropriate RNA 

25 polymerases andtiieappropriatelabdednucleotides. Hybridization probes may be labded by a variety 
of reporter groups, for exanqjle. by radionuclides sudi as ^ or «S, or by enzymatic labds, sudias 
alkaline phosphatase coupled to the probe via avidinA)iotincouplmg systems, andtiielike. 

Polynucleotide sequences encodmg TRICH may be used for the diagnosis of disorders 
assodated with expression of TRICH. Examples of such disorders include, but arenotlhnitedt^^ 

30 fransport disorder sudi as akinesia, amyotrophic lateral sclerosis, ataxia telangiectasia, cystic fibrosis. 
Bedcer-s muscular dystrophy. Bell's palsy. Charcot-Marie Tootii disease, diabetes memtus. diabetes 
insipidus, diabetic neuropathy, Duchemie muscular dystrophy, hyperkalemic periodic paralysis, 
noimokalemic periodic paralysis, Parkinson's disease, malignant hyperthermia, multidrug resistance, 
myaslhaua gravis, myotonic dystrophy, catatonia, tardive dyskinesia, dystonias, peripheral 
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neuropathy, cerebral neoplasms, prostate cancer, cardiac disorders associated with transport. e.g.. 
angina, bradyairythmia, tachyairythmia, hypertension. Long QT syndrome, myocarditis, 
cardiomyopathy, nemaline myopathy, centronnclear myopathy, lipid myppafliy, mitochondrial 
myopathy, thyrotoxic myopathy, ethanol myopathy, dermatomyositis, inclusion body myositis, 
5 infectious myositis, polymyositis, neurological disorders associated with transport, e.g.. Alzhdmer's 
disease, amnesia, bipolar disorder, dementia, depression, epilepsy, Tourette's disorder, paranoid 
psychoses, and schizophrenia, and other disorders associated with transport, e.g.. neurofibromatosis, 
postherpetic neuralgia, trigenunal neuropatiiy, sarcoidosis, sickle cell anemia, Wilson's disease, 
cataracts, infertiUty. puhnonary artery stenosis, sensorineural autosomal dea&ess, hyperglycemia, 
10 hypoglycemia. Grave's disease, goiter, Cushing's disease, Addison's disease, glucose-galactose 

malabsorption syndrome, hypercholesterolemia, adrenoleukodystrophy. Zellweger syndrome, Menkes 
disease, occipital horn syndrome, von Gierke disease, cystinuria, uninoglycinuria, Hartup disease, and 
Fanconi disease; a neurological disorder such as epilepsy, ischemic cerebrovascular disease, stroke, 
cerebral neoplasms, Alzhdmer's disease. Pick's disease, Huntington's disease, dementia. Parkinson's 
15 disease and otho" extrapyramidal disorders, amyotrophic lateral sclerosis and otiier motor neuron 
disorders, progressive neural muscular atrophy, retinitis pigmentosa, hereditary ataxias, multiple 
sderosis and other dmyelinating diseases, bacterial and viral meningitis, brain abscess, subdural 
empyema, epidural abscess, suppurative inJracranial tiirombophlebitis, myelitis and radiculitis, viral 
central nervous system disease, prion diseases mcluding kuru, Creutzfeldt-Jakob disease, and 
20 Gerslmann-Straussler-Schehiker syndrome, fatal familial insomnia, nutiitional and metaboUc diseases 
of the nervous system, neurofibromatosis, tuberous sclerosis, cerebelloretinal hemangioblastomatosis, 
encqjhalotrigemima syndrome, mental retardation and otiier developmental disorders of tiie central 
nervous system induding Down syndrome, cerebral palsy, neuroskeletal disorders, autonomic 
nervous system disorders, cranial nerve disorders, spinal cord diseases, muscular dyshophy and otiier 
25 neuromuscular disordfaB. peripheral nervous system disorders, dermatomyositis and polymyositis, 
inherited, metabolic, aidocrine. and toxic myopathies, myasflienia gravis, periodic paralysis, mental 
disorders induding mood, anxiety, and sdiizophrenic disord^s. seasonal affective disorder (SAD), 
akathesia. amnesia, catatonia, diabetic neuropathy, tardive dyskinesia, dystonias, paranoid psychoses, 
posthetpetic neuralgia, Tourette's disorder, progressive supranudear palsy, corticobasal degeneration. 
30 and familial fiontotemporal dementia; a musde disorder sudi as cardiomyqiathy. myocarditis, 
Dudienne's muscular dystrophy. Becker's muscular dystrophy, myotonic dystrophy, central core 
disease, nemaline myopathy, centronudearmyqpathy, lipid myopatiiy. mitodiondrial myopathy, 
infectious myositis, polymyositis, dermatomyositis, indusion body myositis, tiiyrotoxic myopafliy, 
ethanol myopathy, anghia, anaphylactic shock, airhyflmiias. asthma, cardiovascular shodc. Cushing's 
35 syndrome, hypertension, hypoglycemia, myocardial hrfarction, migrahie, pheodiromocytxima. and 
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myopatbies mcluding encephalopathy, epilepsy. Keams-Sayre syndrome, lactic acidosis, myoclonic 
disorder, ophthalmoplegia, and acid maltase deficiency (AMD. also known as Pompe's disease); and 
an immunological disorder such as acquired immunodeficiency syndrome (AIDS). Addison's disease, 
adultrespiratory distress syndrome, allergies, ankylosing spondylitis, amyloidods, anemia, asthma, 
atherosclerosis, autoimmune hemolyttc anemia, autohnmune thyroiditis, autoimmone 
polyendocrinopathy-candidiasls-ectodeimal dystrophy (APECED). bronchitis, cholecystitis, contact 
dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis. diabetes mdlitns. emphysema, 
episodic lymphopenia wlthlymphocytotoxms. erythroblastosis fetalis, erythema nodosum, atrophic 
gastritis, glomerulonephritis, Goodpasture's syndrome, gout, Graves' disease, HasWmoto's 
thyroiditis, hypereosuiophma. Irritable bowel syndrome, multiple sclerosis, myasthenia gravis, 
myocardial or pericardial inflammation, osteoarthritis, osteoporosis, panaeatitis. polymyositis, 
psoriasis. Reiter's syndrome, rhemnatold arthritis, scleroderma, Sjogren's syndrome, systemic 
anaphylaxis, systemiclupos erythematosus, systemic sclerosis, ti^rombocytopenic purpura, ulcerative 
colitis, uveitis. Wema: syndrome, complications of cancer, hemodialysis, and extracorporeal 
circulation, viral, bacterial, fungal, parasitic, protozoal, and hehninflric mfections. and tramna. The 
polynucleotide sequences encodtog TRICH may be used In Southern or northern analysis, dot blot, or 
other memhrane-basedtechnfdogies;taPCR technologies; in dipstick, pin, andmultiformatELISA-like 
assays; and in microairays utilizing fluids or tissues from patients to detect altered TRICH expressloa 
Such qualitativeor quantitative mefliDds are wdlknownmthe art. 

In a particular aspect, tiie nudeotide sequences encoding TRICH may be useful in assays tiiat 
detecttiiepresenceofassoclateddisordfi«.particularlythDsemaitionedabove. Thenucleotide 

sequences encoduig TRICH may be labded by standard methods and added to a fluid or tissue san?.le 
fromapatientundet conditions suitable fortiieformation of hybridizationcompleKes. After a suitable 

incubation period, the sample is washed and flie signal is quantified and compared wiOi a standard 
value. Iftiieamountofsignalinthepatientsampleissignificanttyalteredmcon5)arisontoacort^^ 
san?)letiimthepresence of altaedlevds of nudeotide sequences eocodi^ 
indicates tiie presm:e of the assodated disorder. Sudi assays may also be used to evjauate ti^ 

of aparticulartiier8q)eutictreattnait regimen In animid studies, in dinicaltriids, or to ^ 

treatmait of an individual patient 

In order to provideabasis for tiK diagnosis ofadisorderassodatedwifli expression of TRICH. 

anormalorstandardprofileforexpressionisestablished. This may be accomplished by combhiing 
body fluids or cdl extracts taken from normal subjects, dtiier anmud or hmnan. with a seqoeoce, or a 
fragment tiiereof , encoding TRICH, under conditions suitable for hybridization or amplification. 
Standard hybridizationmay be quantifledby comparing tiie values obtained fromnormalsubj^ 
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values from an expeiitmA in wbich a knovm amount of a sabstantiaHy porified polynudeotide is iisesd. 
Standard values obtained in this manner may be compared with values obtained from samples frran 
patients wlio are symptomatic for a disorder. Desviation from standard values is used to establish the 
preseoce of a disorda:. 

5 Once the presence of a disorder is estabUshed and a treatment protocol is initiated, 

hybridization assays may be repeated on a regular basis to detennineif thelevd of expression hi the 
patient begins to approximate that which is observed in the normal subject. The results obtahied from 
successive assays may be used to show the efficacy of treatmaitover a period ranging from several 
daystomonflis. 

10 "With respect to cancer, the presMice of an abnormal amount of transcript (either under- or 

overexpressed) mbippsied tissue from an individual may indicate a predisposition for the devdopmoit 
of the disease, or may provide a means fijr detecting the disease prior to the appearance of actual 
dhiical synqrtoms. Amore dative diagno^ of this typemay aUow health professionals to employ 
preventative measures or aggressive treatment earlier thereby preventhig the devdopment or furttier 
15 progression of the cancer. 

Additional (fiagnostic uses for oligonucleotides designed fromlhe sequraces oicoding TRICH 
mayhivolvetheuseofPCR. These dligomffs may be diemically synthesized, generated oizymatically. 
or produced injgtto. Oligomers will preferably contain a fragment of a polynucleotide encodmg 
TRICH, or a fragment of a polynudeotide complementary to the polynucleolide encoding TRICH, and 
20 win be en5)loyed under c^rthnized conditions for identification of a specific ga^ 

OUgomers may also be anployed under less stringent conditions for detection or quantification of 
dosdy rdated DNA or RNA sequfioces, 

maparticular aspect, oligonndeotide primers derived from the polynudeotide sequences 
raicoding TRICH may be used to detect smgle nudeotidepdymorphisms (SNPs). SNPs are 
25 substitutions, insertions and ddetions that are a frequent cause of uiherited or acquired genetic disease 
in humans. Methods of SNP detection mdude, but are not limited to, shigle-stranded conftirmatian 
polymorphism (SSCP) and fluorescent SSCP (fSSCP) methods. In SSCP, oligonacleotide primers 
derived from the polynucleotide sequences mcoding TRICH are used to anqjlify DNA ushig the 
polymerase chamreactton (PGR). The DNA may be derived, for example, from diseased or normal 
30 tissue, biopsy samples, bodily fluids, and the like. SNPs in the DNA cause differences inihe secondary 
and tertiary structures of PGR products m single-stranded form, and these differences are detectaWe 
ushig gd dectrophoresis in non-denaturing gds. In fSCCP, the oligonucleotide primers are 
fluorescenfly labded, whidi allows detection of the amplimers m high-throughput equipment such as 
DNA sequendng madnnes. Additionally, sequence database analysis methods, termed in siUco SNP 
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(isSNP) are capable of identifying polymorpMsms by comparing the sequence of individual 
overlapping DNA ftagments wbich assemble into a common consensus sequence. Tbese can5,uter. 
basedm^filteroutsequencevariationsduetolaboratorypreparationofDNAa^ 
e^otsusingstatisticalmoddsandautomatedanalysesofDNAsequencecbromatogra^^ mibe 
5 alteoiative.SNPsmaybedetectedandc^acterizedbymassspect^^^ 

UirouriroutMASSARRAY system (Sequeoom, Inc., San Diego CA). 

MettK)ds>^dimayalsobeu^toquantifytl.expressionofl^CH^^^^^ 

biotinylating nucleotides, coamplification of a control nuddc acid, and interpolating results from 
standard carves. (See, eg.. Mdby. P.C. et al. (1993) 1. Imn^^L Methods 159:235-244; Duplaa C. et 
10 al(1993)And.Biochm.212:229-236.)ThespeedafquantitationofmuMplesm^^^ 
acxderatedby running theassayinaWgl.1i.ougbp«tformat>vto^ 
interest ispresentedin various dilutions andaspectropbotom^corcolo^^ 

quantitation. 

to ftrtte «d»aa«ls, <«gomKa««to at 

c.nbeus»lintta„scriptlmaElBgt«a«d,a«wlianKrt»«»«ta^ 

.umbers Of ge.es slMUltaneou^y as acscribolb^w. T«Mcr<»„rayn«y*>be.«dtoltaa.y 
ge«dcvarlaMs.m.taflo.s.a»lpolymorpUsms. ras WOr«lio«n»yte«sedto<««*»ge..« 

20 p™gressio./regression<>rdisea»asafu««o.<*se.e«?-»^o...-*'o*^^ 

.cavlUesoftoape«icage«sln««ttea.«tof*ease. to particular. «slp*«a.i<».n«,y be «scd 
„devdopapbar«coge«a»cproffleofapa«e«to<rte.tos<iecttbem«.ppr«^ 
„ea«>ettreg».e.ft.matpa.e.t Fo, sample, ««ape»ttcage«s««cha«higW«ft<«vea.a 
,,^Uya»fcwestsideeffec,.mybese.ec«c.forapatte*basedo.Usteplw»c^ 
25 to.«)tt«eoJKKlin«..TIUOT.fragme„.sofTWCH,«artbodi.sspeclficfaTOCHm^ 
»e,s=dasae»«so.an>icroaiay. TteMcroa,ray my be used to «»Dit« or measure proteto- 
prrtei. Irter^aons, drug-target u«eraetiom, aud gene expressloaproffles. as descdW -»v^ 

Aparticularembodlmefflrelatestoteuseoftlepolynucleotides otliepres«tta«fflio«to 
ge^raeaBa-Bcnptlmageorattssueorcettwe. A»auscrtp.toager,^tt«globalpa«emof 
30 ge^expresstabyapartcularttssueorceatyp^ Global ge» expression patterns are analyzed by 
«^fl«mmto«e.^ge«s..>dtb*rdaaveabm^ 
giventime. (SeeSerihamere.a,X<».,»ativeQmena.«alptAnalysl^^^^ 
5840484 e^ytocorporaiedbyrArenceteta.) Tte a trama*. image may be generated by 
,^Mfflzi.«,tepoIym.de«idesrftbeprese«liwe«ionor«*a^^ 
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transcripts or reverse txanscripts of a particular tissue or ccfl type. In one emUodiment, the 
hybridization takes place inbigh-throughput format, wherein the polynucleotides of thepresent 
invention or their complements comprise a subset of a plnraUty of dements on a miaoairay. Tbe 
resultant transcript image would provide a proffle of gene activity. 

5 TransCTipt unages may be generated usmg transcripts isolated from tissues, cdl lines, Irippsies, 

or oflier biological ssaaples. Tbe transaipt image may thus rrflect gene expression jn^jvo, as inthe 
case of a tissue or Wopsy sample, or in vitro , as ui the case of a cell luie. ( 

■ftanscript images wMch proffle the expression of the polynucleotides of the present invention 
may also be used m conjunction with isLSaSQ model systems and preclinical evaluation of 

10 phamaceaticals,aswcfflastoxicologicaltestingofhidustrialandnatur^^^^ 

compounds. All corapoands induce diaractetistic gaie expression patterns, frequently termed 
molecular fingetprhrts or toxicant signatures, whidi are indicative of mechanisms of action and toxicity 
(Nuwaysir. E.F. et al. (1999) Mdl. Cardnog. 24:153-159; Sldner. S. and NX. Anderson (2000) 
ToxicoL Lett 112-113:467-471. expressly incorporated by reference herein). If a test compound has a 

15 signature similar to that of a compound with known toxicity, it is liMy t» share those toxic properties, 
•mesefingeqjrints or signatures are most useM and refined when they contamexprfessicmuiformation 
fromalargennmber of genes and genefamiUes. ideaDy. a genome-wide measurement of expression 
provides the highest quality signature. Even genes whose ejqiression is not altered by any tested 
ooB^Kxinds arein?)ortant as wdl, as theleveOs of expression of these genes are used to normalizethe 

20 rest of the expression data, ihe normalization procedure is useM for CQn?)arisan of expression data 
after treatment v(dth different conpounds. Whiletheassignmentof gene function to ea^nents of a 
toxicant signature aids in hitapretation of toxicity medumisms. la^ 

necessary for the statistical matchtag of signatures which leads to prediction of toxidty. (See, for 
example. Press Release 00-02 from flie National Institute of Environmental Health Sciences, released 
25 February 29, 2000. available at http://www.nidis.nih.gov/oc/newsAoxcbipJitm.) Therefore, it is 
important and desirable in toxicological screenmg ushig toxicant signatures to hidude an expressed 
goie sequences. 

mone embodiment, tiie toxicity of atest compound is assessed by treatingahiological sample 
contauung nucleic acids with the test compound. Nucleic acids that are expressed in flie treated 
30 biological sample are hybridized with one or more probes specific to the polynudeotides of the 
present invention, so tiiat franscript levels correspondmg to tiie polynucleotides of the present 
hivention may be quantified. The transcript levds in tiie treated biological sample are compared with 
levds M an untreated biological sample. Difi^erences m tiie transcript levds between tiie two samples 
are indicative of a toxic response caused by the test compound ui tiie treated sanq)le. 
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Anottier particular embodiment relates to the use of flie polypeptide sequences of the preseait 
invention to analyze the proteome of atissueor cell type. The term proteome refers to the global 
pattern of protein expression in a particular tissue or ceill type. Each protein component of a proteome 
can be subjected individually to further analysis. Proteome expression patterns, or proffles, are 
5 analyzed by quantifying the number of expressed proteins and tiidr relative abundance under given 
conditions and at a given time. A profile of a ceH's proteome may tiius be gaierated by separating and 
analyzmg flie polypeptides of a particular tissue or cell type. In one embodiment, tiie separation is 
acMeved using two-dimensional gd dectrophoresis, in which protons from a sample are separated by 
isodectric focusing in the first dimension, and then according to molecular weight by sodium dodecyl 
10 sulfate slab gd dec(rq[taesis in the second dunaision (Stdner and Anderson, giffira). The proteins are 
visualized in the gd as discrete and uniqudy positioned spots, typically by staining the gd wifli an agent 
such as CoomassieBlue or sflvei or fluorescait stains. The optical density of each protein spot is 
generally prqportionsil to the levd of theprotdn inihe sample. The optical densities of equivaleoJly 
positioned protein spots from different samples, for example, from biological samples either treated or 
15 untreated wifli a test conpjund or flierapeutic agent, are CGnspai&i to itoitify any dianges m protdn 
spot density rdated to the treatmeoL Theprbtdnsintiie spots arepartiallysequfflicedu^, for 
example, standard methods employing diranical or enzymatic deavage followed by mass spectrometry. 
The identity of tiie protdn in a spot may be determined by comparing its partial sequence, preferably of 
at least 5 contiguous amino add residues, to tiie polypeptide sequences of tiie present invention. In 

20 some cases, further sequeaice data may be obtained for definitive i»otdn identificatioa 

A proteomic profUe may also be generated using antibodies spedfic for TRICH to quantify the 
levds of TRICH ^ressioa In one onbodunent, tiie antibodies are used as demeaits on a midroarray, 
and protein expression levels are quantified by exposing tiie microarray to flie sample and deteddng the 
levds of protdn bound to each array element (Lueking, A et al. (1999) AnaL Biodiem. 270:103-111; 

25 Mendoze, L.G. d al. (1999) Biotechniques 27:778-788). Detection may be performed by a variety of 
metiiods Icnown in tiie art, for example, by reacting tiie protdns in tiie sample witii a tiiiol- or amino- 
reactive fluorescent compound and detecting tiie amount of fluorescence bound at each array element. 

Toxicant signatures at tiie proteome levd are also useful for toxicological screening, and should 
be analyzed in paralld witii toxicant signatures at the transcript level. Thoe is a poor correlation 

30 between ti-anscript and protean abundances for s(Hne protdns in some tissues (Andorson, Ni. and J. 
Sdlhama- (1997) Eledn^oresis 18:533-537), so proteome toxicant signatures may be useful in tiie 
analysis of can?)ounds wMdi do not significantiy affea tiie transcript image, but which alter tiie 
proteomic profile. In addition, tiie analysis of tiransoipts mbody fluids is difficult, duetorajdd 
degradation of mRNA, so proteomic profiling may be more rdiable and informative fa sudi cases . 
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In another embodimmt, the toxidtyofatestcompomKi is assessedby^ 
sairqjlecontamiiigprotdBS with the test compound. Protdns that are expressed in the Heated biolokical 
san5,le are separated so that the amount of eachprotdncanbequanttfied. The an30unttf^ 
is compared to the amount of the corresponding protein in an untreated biological sample Adifference 
inthe amount of protein between the two san„les is indicative of a toxic response to the test compound 
inthetreatedsaim)le. individual proteins are identified by sequencing the amino acid residues of the 
individual proteins and comparing these partial sequences to the polypeptides of the present inventioa 

In another embodhnent, the toxicity of a test compound is assessed by treating a biological 
samplecontainingproteinswithlhetest compound. Proteins from the biological sample are incubated 
10 withantibodiesspecifictolhepolypeptidesaflhepresentinvention. The amount of protein recognized 
by the antibodies is quantified, amount of protein in the treated biological sample is compared with 
theamountuianuntreatedbiologicalsan^le. Adi£fera>cemihe amount of protein between the two 
san5)les is indicative ofatoxic response to thetestcompoandhilhe treated saii5)le^ 

Microarraysmaybeprepared.used.andanalyzedusingmethDdsknownhitheart (See.e.g.. 

15 Bremian. T.M. et al. (1995) U.S. PatentNo. 5.474.796; Schena, M. e* al. (1996) Proc. Natl. Acad. Sd. 
USA'93:10614.10619; Baldeschwefler et al. (1995) PCT application W095/25 1116; Shalon. D: et al. 
(1995) PCT ^plication WO95/35505; Hdler. R.A et al. (1997) Proc. NatL Acad. So. USA 94:2150- 
2155; andHeIler.M.J.etal. (1997) U.S.PatentNo.5,605.662.) Various types of microarrays are wdl 
known and thoroughly described mDNM^irrmrr^ys- A PracHq^l Approadi. M. Sdiena. ed. (1999) 
20 OxfordUniversityPress,London.herd)yexpresSlyincorparatedbyreference^ 

In another mbodfanent of the invention, nuddc add sequences «icoding TRICH may beused 
togeneratehybridizationprobesusefulinmappmgthenaturattyoccurringg^^ Either 
coding or noncoding sequences may be used, and in some instances, noncodhig sequences may be 
preferable over coding sequences. For example, conservation of a coding sequence among members 
25 of a multi-gene family may potentially cause undesiied aoss hybridization during dm>mosomal 
mapping. TTie sequences may be mapped to a particular daromosome. to a spedfic region of a 
diromosome. or to artificial chromosome construdlons. e.g.. hmnan artifidal diromosomes (HACs). 
yeast artifidal diromosomes (YACs), bacterial artifidal diromosomes (BACs). baderial PI 
constiudions. or smgle diromosome cDNA libraries. (See, e.g., Harrington. J.J. et al. (1997) Nat 
30 Genet 15:345-355; Price. C.M. (1993) BloodRev. 7:127-134; andTras^B.!. (1991) TV«^ Genet 
7-149-154.) Once mapped, the nucldc addsequaices of tiie invention may be used to devdop genetic 
linkage maps, for examide.wWdi correlate the udieiitanceofadiseasestat^ 
particular diromosome region or resttidion fragment length polymorphism (RFLP). (See. for 
exaim)le. Lander. E.S. andD. Botstdn (1986) Proc. Nafl. Acad. Sd. USA 83:7353-7357.) 
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HuOTescentm^hybri(Hzation(FISH)inaybecQrrdate^ 
data. (See. e.g.. Heinz-Ulrich. et aL (1995) in Meyers, ssEra. pp. 965-968.) Exanq)les of genetic map 
(tota can be fomid in various scientmc journals or at the OnliiieMenddia^ 
World Wide Web site. Corrdation between the location of the gene encoding TRICH oa a physical 
map and a specific disorder, or a predisposition to a specific disorder, may help deflnether^on of 
DNA associated with that disorder and thus may further positional cloning efforts. 

msita hybridization of chromosomal preparations and physical mapping techniques, such as 
linkage analysis using established chromosomal markers, may beused for extending genetic maps. 
Often the pLacement of a gene on the chromosome of another mammaUan species, such as mouse, may 
reveal associated markers evenlf theeatact diromosomal locus is not knowa This information is 
valuable to hivestigators searching fat disease genes using positional cloning or otha gene discovery 
tfidmiques. Once the gene or genes responsible for a disease or syndrome have been crudely localized 
by genetic linkage to aparticular genomic region, e.g., ataria-tdangiectasia to llq22-23, any sequences 
msq)ping to that area may represent associated or regulatory genes for fiirther mvestigatioa (See, e.g., 
Gatti. R.A. et aL (1988) Nature 336:577-580.) The nucleotide sequence of the instant invention may 
also be used to detect differences in the chromosomal location due to translocation, inversion, etc., 
among normal^ carrier, or affected individuals. 

In another embodiment of thehivaitiQn, TRICH. its catalytic or immunogenic firagments, or 
oligopeptides thereof can be used for screening Ubraries of compounds m any of a variety of drug 
screening techniques. The fragment employed in such screening may be firee in sohition, affixedtoa 
solid support, borne on a cdlsurfeccorlocatedinlracellularly. The formation of binding complexes 
brtweMi TRICH and the agent beang tested may be measured. 

Another technique for drug saeenhig provides for high throughput screenhig of compounds 
having suitable binding affinity to the protein of interest (See, e.g.. Geysen. et al. (1984) PCT 
appUcationWO84/03564.) inthismethod.largennmbersof different smaH test compounds are 
synthesized on a soUd substrate. The test compounds are reacted with TRICH. or fragments thereof, 
and washed. Bound TRICH is then detected by methods wdl known in the art: Purified TRICH can 
also be coated directly onto plates for use in the aforanentioned drug screening techniques. 
Altemativeiy. non-neutralizing antibodies can be used to capture the pq)tide and hmnobilize it ma 

solid si^ort 

In another embodunent, one may use conq)etitive drug screening assays m which neutralizmg 
antibodies capable of binding TRICH specifically compete with a test compound for bmding TRICH. 
In this manner, antibodies can beused to detect the presence of any peptide which shares one or more 

antigenic determinants with TRICH. 
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m additional anbodimasts . tbe raideotide sequences wWdi encode TRICH may be iised in any 
molecular biology techmques that have yettobe developed, provided Ihen^ 
properties of nucleotide sequences that are cmwHy laxmu including, but not limited to. such 
properties as the triplet genetic code and spedflc base pair interacttons. 
5 Withoutfurmerelaboration,itisbeUevedthatonesmedintheartcan.usingthepreceding 

description.utilizethepresentinventiontoitsfunestextent The foDowhig embodiments are. 
therefore, to be constnied as merely inustrative, and not limitative of the remainder of the ^^^^ 

in any way whatsoeva:. 

The disclosures of ail patmts, applications and publications, mentioned above and bdow, in 
10 particular U.S. Ser. No. 60/172,000. U.S. Set. No. 60/176,083. U.S. Ser. No. 60/177.332, U.S. Ser. 
No. 60/178.572. U.S. Ser. No. 60/179.758. and U.S. Ser. No. 60/181.625. are expressly incorporated 
by rdFerence herein. 

EXAMPLES 

15 L Construction of dDNA Libraries 

mcytecDNAs were derived from cDNAUhraries described in the LIFESEQ GOLD database 
(IncyteGen(Mnics.PaloAltoCA)andshowninTable4. columns. Some tissues were homogenized 
and lysed in guanidinium isothiocyanate. while others were homogenized and lysed in phenol or in a 
suitable mixtureofdenaturants. suchasTREOL (Ufe Technologies), a monophasic solution of phenol 

20 andguanidineisolhiocyanate. H» resulting lysates were centrifuged over CsQ cushions or extracted 
witiicUoroform. RNA was predpitatedfitmtiJelysates with either isopr(5>an(fl or soffiu^ 

^hand. or by ottier routine mi^bods. 

Phenol eirtractton and prec^itation of RNA were repeated as necessary to inaease RNA 
purity. Insomecases. RNA was treated witiiDNasft For most libraries. pQly(A)+ RNA was isolated 
25 using ohgod(T)KX)upled paramagnetic particles (Promega).OUGOTEX latex p^^^ 

ChatsworthCA).oranOUGarEXmRNApurificatiQnldt(QIACffiN). Allmiativeiy.RNAwas 
isolated direcfly from tissue lysates usmg oflier RNAisdafionldts. ag.. thePOLY(A)PURE mRNA 

purification kit (Ambion. Austhi TX). 

In some cases, Stratagene was provided with RNA and constructed the correspondhig cDNA 
30 libraries. Otherwise, cDN A was syntiiesized and cDNAHbraries were constructed with tiieUNEAP 
vector system (Stratagene) or SUPERSCRIPT plasmid system (Life Tedmologies). using the 
recommendedprocedures or similar methods known inthe art. (See. e.g.. Ausubd. 1997. suEra. units 
5.1-6.6.) Reverse ti-anscription was ujitiated using oUgo d(T) or random primers. Synthetic 
digonodeotide adaptas were ligated to double stranded d)NA, and tiie cDNA was digested with tiie 
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appropriaterestriction enzymeor enzymes. For most libraries. IhecDNA^vas size-sdected (300-1000 
bp) using SEPHACRYL SIOOO, SEPHAROSE CL2B. or SEPHAROSE CL4B column 
chromatography (Amersham Pharmacia Biotech) or prq)arative agarose gel electrophoresis. cDNAs 
wereligatedinto compatible restriction enzyme sites of ihepolylinker of a suitable plasmid, e.g.. 
PBLUESCRIPT plasmid (Siratagene). PSPORTl plasmid (Life Technologies), PCDNA2.1 plasmid 
(tavifrogen. Carlsbad CA). or pINCY ancyte Genomics. Palo Alto CA). Recombinant plasmids 
iransfonnediitfo competentE^ cdls induding XLl-Blue. XLl-BlueMRF. o^ 
Stratageneor DHScPHlOB. orHecttoMAXDHlOB from life Technologies. 

n. Isolation «f cDNA Qones 

Plasmids obtained as described in Example I were recovered from host cdls by in^dyQ excision 
„singtheUNIZAPvectorsystem(Stratagene)orbyceniysis. Plasmids were purified using at least 
oneoflhefQllowing: a Magic or WIZARD MiniprepsDNA purification system (Promega); anAGTC 
Miniprq, purification kit (Edge Biosystems. Gaittersburg MD); and QIAWELL 8 Plasmid, QIAWELL 
8 Plus Hasmid, QIAWELL 8 Ultra Plasmid purification systems or tiie R.E AL. PREP 96 plasmid 
purificationldtfromQIACffiN. Followtogprecipitation.plasmidswereresuspendedin0.1mlof 
distilled water and stored, witii or wilhout lyqphilization, at 4'C. 

AlteniativeJy. plasmid DN A was amplified from host cell lysales using direct link PCR in a 
high-throughput format (Rao. V.B. (1994) AnaL Bioch^ 216:1-14). Host cdl lysis and tiiermal 
cycling stqjswerecarriedoutinasdnglereactionmixtoe. Samples were processed and stored in 384- 
wdl plates, and the conc«rtration of amplifiedplasmidDNA was quantified fluoromefrically using 
PICOGREEN dye (Molecular Probes. Eugene OR) and a FLUOROSKAN U fluorescence seamier 
(Labsystems Oy, Helsinki, Finland). 
III. Sequencing and Analysis 

Incyte cDNA recovered hi plasmids as described in Example H were sequenced as follows. 
Sequenchig reactions w^e processed usmg standard methods or high-flnoughput instrumentation 
such as die ABI CATALYST 800 (PE Biosystems) tiiermal cycler or tixe PTC-200 tiiermal cycler (MJ 
Research) m conjmiction witii tiie HYDRA microdispenser (Robbins Scientific) or tiie MICROLAB 
2200 (Hamilton) hquid transfer system. cDN A sequencing reactions were prepared using reagents- 
provided by Ameidiam Pharmacia Biotech or suppUed ui ABI sequencmg kits such as.tiie ABI 
) PRiSMBIGDYET€iminatorcyclesequencingreadyreactionkit(PEBiosystems). Electrpphoretic 
separation of cDNA sequencing reactions and detection of labeled polynucleotides were carried oat 
usmglheMEGABACE 1000 DNA sequenchig system (Molecular Dynamics); tiie ABI PRISM 373 or 
377 sequencing system (PE Biosyst«ns) hi conjmiction witii standard ABI protocols and base callmg 
software; or ofliersequenceanalysissystemsknownhitiieart Reading frames wifliin tiie cDNA 
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sequraices were identified using standard mesthods (reviewed in Ausubei, 1997, unit 7.7). Some 
of tiie cDNA sequences were sdected for extaisiCBi using the techniques disclosed in Example vm. 

nie polynucleotide sequences derived from Incyte cDNAs were validated by removmg vector, 
linker, and poly(A) sequences and by masking ambiguous bases, using algorittuns and programs based 

5 (m BLAST, dynamic programming, and dinudeotide nearest ndghbor analysis. Tlie Incyte cDNA 
sequences or translations thereof were thm queried against a sdection of pubUc databases such as the 
GenBank primate, rodent, mammalian, vertebrate, and eukaryote databases, and BLOCKS, PRINTS, 
DOMO, PRODOM, and hidden Marlcov modd (HMM)-based protein family databases such as PFAM. 
(HMM is a probabilistic approach which analyzes consensus primary structures of gene families. 

10 See, for example, Eddy, S.R. (1996) Curr. Opin. Struct Biol. 6:361-365.) The queries were 
performfid using programs based on BLAST, FASTA, BLIMPS, and HMMR. The Incyte cDNA 
sequences were assembled to produce full length polynucleotide sequences. Alternatively, GenBanlc 
cDNAs, GenBank ESTs, stitched sequences, stretched sequences, or Genscan-predicted coding 
sequeances (see Examples IV and V) were used to extend Incyte cDNA assemblages to full length. 

15 Assembly was performed using programs based on Phred, Phrap, and Consed. and cDNA assemblages 
were scremed for open reading frames using programs based on GeneMark, BLAST, and FASTA. 
The fiffl length pdynucleotide sequences were translated to daive the corresponding full lengtli 
polypeptide sequences which were subsequtaiQy analyzed by querying against databases such as the 
GenBankprotein databases (genpqjt), SwissProt, BLOCKS, PRINTS. DOMO, PRODOM, Prosite, 

20 and Mddm Markov niDdd(HMM)-based proton family databases such as PFAM. Full length 
pdynndeotide sequences are also analyzed using MACDNASIS PRO software (Hitachi Software 
Engineering. South San Francisco CA) and LASERGENE software (DNASTAR). Polynucleotide and 
polypeptide sequence alignments are generated using default parameters specified by the CLUSTAL 
algorithm as hicoqxirated mto the MEGALIGN mnltisequence alignment program (DNASTAR). whidi 

25 also calculates the percent identity betweoi aligned sequences. 

Table 7 summarizes the tools, programs, and algorittuns used for tiie analysis and assembly of 
Incyte dDNA and full length sequences and provides applicable descriptions, references, and threshold 
parameters. The first column of Table 7 shows the tools, programs, and algorithms used, the second 
column provides brief descriptions tiiereof , the third column presents appropriate references, aU of 

30 which are incorporated by reference herdn in Ihdr entirety, and the fourth column presents, where 

applicable, the scores, probability values, and other parameters used to evaluate the strengfli of a matdi 
between two sequences (the higho- the score or tiie lower tiie probabiUty value, ttie greater tiie identity 
betweaitwo sequences). 

The programs desaibed above for tiie assembly and analysis of fiill lengtii polynudeotide and 
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polypeptide sequences were also used to identify polynucleotide sequence fragments from SEQ ID 
NO:28-54. Fragmrats from about 20 to about 4000 nucleotides which are useful inhybridizatiou and 
ampMcation technologies are described in Table 4, column 4. 

IV. Identification and Editing of Coding Sequences firom Genomic DNA 

5 Putative transporters and ion channels ware initially identified by running the Genscan gene 

identification program against pubUc genomic sequeaoce databases (e.g., gbpri and gbhtg), Genscan is a 
general-purpose gene identification program which analyzes genomic DNA sequences from a variety of 
organisms (See Burge, C. and S. Karlin (1997) J. MoL Biol. 268:78-94, and Surge, C. and S, Karlin 
(1998) Curr. Opin. Struct. Biol. 8:346-354). The program concatenates predicted exons to form an 

10 assembled cDNA sequence extending from a m^onine to a stop codon. The output of Genscan is a 
FASTA database of polynucleotide and polypeptide sequences. The maximum range of sequence for 
Gaiscan to analyze at once was set to 30 kb. To detmnine which of tiiese Geoscan predicted cDNA 
sequences encode transporters and ion channels, tiie encoded polypq)tides w^e analyzed by querying 
against PFAM models for transporters and ion channels. Potential transporters and ion channds were 

15 also identified by homology to Incyte cDNA seqa^ices tiiat had be^ annotated as transporters and ion 
channels. These selected G^can-predicted sequences were tiien compared by BLAST analysis to the 
gei^ept and gbpri public databases. Where necessary, ttje Genscan-predicted sequaices ware ihssi 
edited by con5)arison to the top BLAST hit from genpept to correct errors in tiie sequence predicted by 
Genscan, such as extra or omitted exons. BLAST analysis was also used to find any Incyte cDNA or 

20 public cDNA coverage of ttie Gaiscan-predicted sequmces, tinis providing evid^ice for transcription. 
WhMi Incyte cDNA coverage was available, this information was used to correct or confirm the 
Gaiscan predicted sequaice. Fulllengflipolynucieotidesequaices ware obtained by assembling 
Geaiscan-predicted coding sequences witii Incyte cDNA sequences and/or public cDNA sequences usmg 
flie assembly process described in Example ffl. Alternatively, full length polynucleotide sequences were 

25 derived endrdy from edited or unedited Genscan-predicted coding sequences. 

V. Assembly of Genomic Sequence Data with cDNA Sequence Data 
"Stitched** Sequences 

Partial cDNA sequmces were extaided with exons predicted by the Gooscan gene idoitification 
program described in Exan?)le IV. Partial cDNAs assembled as described in Example HI were mapped 
30 to genomic DNA and parsed into clusters containmg related cDNAs and Gaiscan exon predictions from 
one or more genomic sequences. Each cluster was analyzed using an algorittim based on graph tiieory 
and dynamic programming to integrate cDN A and genomic information, gena-ating possible splice 
variants that were subsequenfly confirmed, edited, or extended to create a fiiU lengtii sequence. 
Sequence intervals in which flie entire laigtti of ttie intoval was present on more flian one sequence in 
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the duster were identified, and intervals thus identified were considered to be equivalent by transitivity. 
For example, if an interval was present on a cDNA and two genomic sequences, then an three intervals 
wereconsideredtobeequivalent. -IMsprocess allows unrdated but consecutive genomic sequences to 
be brought together, bridged by cDNA sequence. Intervals thus identified were then "stitched" togete 
by the stitching algorithm in the order that they appear along their parent sequences to generate the 
longest possible sequence, as wdl as sequence variants. Linkages between intervals which proceed 
along one type of parent sequence (cDNA to cDNA or genomic sequence to genomic sequence) were 
given preference over linkages which change parent type (cDNA to genomic sequence). Hie resultant 
stitched sequences were translated and compared by BLAST analysis to the genpept and gbpri public 
databases. Incorrect exons predicted by Genscan were corrected by comparison to the top BLAST hit 
from genpept. Sequences were further extended with additional cDNA sequences, or by inspection of 
genomic DNA, necessary. 
"Stretched" Sequences 

Partial DNA sequences were extended to full length with an algorithm based on BLAST 
analysis. First, partial cDNAs assembled as described in Example HI were queried against public 
databases such as the GenBank primate, rodent, mammalian^ vertebrate, and eukaryote databases using 
the BLAST program. The nearest GenBank protein homolog was then compared by BLAST analysis 
to either hicyte cDNA sequences or GenScan exon predicted sequences described in Example IV. A 
chimeric protdn was generated by usmg the resultant high-scoring segment pairs (HSPs) to map the 
translated sequences onto the GenBank protein homolog. Insertions or ddetions may occur in the 
chimericproteinwithrespecttotheoriginalGenBankproteinhaniolog. The GenBank protdn homolog. 
the chimeric protein, or both were used as probes to search for homologous genomic sequences from the 
public human genome databases. Partial DNA sequences were therefore "stretched" or extended by the 
additionofhomologous genomic sequences. The resultant stretched sequences were examined to 
deterndne wh^er it contained a con?)l«e gene. 
VI. Chromosomal Mapping of TRICH Encoding Polynucleotides 

The sequences which were used to assemble SEQ ID NO:28-54 were compared with 
sequences from the Incyte UFESEQ database and pubUc domain databases using BLAST and other 
implementations of the Smith-Waterman algorithm. Sequences from these databases that matched 
SEQ ID NO:28-54 were assembled into clusters of contiguous and overlapping sequences using 
assembly algorithms such as Phrap (Table 7). Radiationhybrid and genetic mapping data available 
from public resources such as the Stanford Human Genome Center (SHGC). Whitehead Institute for 
Genome Research (WIGR), and G6n6thon were used to determine if any of tiie clustered sequences 
had been previously mapped. Inclusion of a mapped sequence hi a cluster resulted in the assigmnent 
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Of an sequences of that cluster, including its particular SEQ ID NO:, to that location. 

Map locations are represented by ranges, or intervals, or human chromosomes. The map 
position of an interval, in centiMorgans, is measured relative to the tenninus of the chromosome' s p- 
arm. (The centiMorgan (cM) is a unit of measuremfflU based on recombination fiequendes between 

5 chromosomal markers. On average, 1 (Mis roughly equivalentto 1 megabase (Mb) of DNA in 
humans, although this can vary widdy due to hot and cold spots of recombination.) The cM 
distances are based on genetic markers mapped by G6n6flion which provide boundaries for raxfiation 
hybrid markers whose sequences were included in each of the dnsters. Human genome maps and 
other resources available to the public, such as the NCBI "GeneMap'99" Wodd Wide WeJ) site 

10 (http://www.ncbi.nlm.mh.gov/genemap/), can be employed to determine if previously identified 
disease genes map within or in proximity to the intervals indicated above. 
VII. Analysis of Polynucleotide Expression 

Northern analysis is a laboratory technique used to detect the presence of a transcript of a gene 
and involves the hybridization of a labeled nucleotide sequence to a membrane on whidi KNAs from a 

15 particular cell type or tissue have been bound (See, e.g., Samhrook. su^a, ch. 7; Ausubd (1995) 
supra, ch. 4 and 16.) 

Analogous computer techniques applying BLAST were used to search for identical or related 
molecules in cDNA databases such as GenBank or LIFESEQ (Incyte Genomics). This analysis is 
mudi fester tiian multiple membrane-based hybridizations. In addition, flie sensitivity of tiie computer 
20 search can be modified to determine whether any particular match is categorized as exact <x simflar. 
The basis of the search is the product score, which is defined as: 

BLAST Score x Percent Identity 



25 



5 X minunum {l€aigfli(Seq. 1), length(Seq, 2)} 



The product score takes rato account both flie degree of shnUarity between two sequences and flie lengtti 
of the sequence match. The product score is a normalized value between 0 and 100, and is calculated 
as follows: tiie BLAST score is mnltipUed by tiie percent nudeotide identity and ttie product is divided 
by (5 times tiie lengtii of tiie shorter of tiie two sequences). The BLAST score is calculated by 
30 assigning a score of +5 for every base tiiat matches in a high-scoring segment pair (HSP), and -4 for 
every mismatdi. Two sequences may share more tiian one HSP (separated by gaps). If ttiereismQre 
tiianone HSP, tiim tiie pair wifli tiie highest BLAST score is used to calculate tiie product score. The 
product score represents a balance between fractional avetlap and quality in a BLAST alignment For 
tanqjle, a produd score of 100 is produced only for 100% identity over tiie entire lengtii of tiie shorter 
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erf the two sequeaces bdng compared. A product score of 70 is produced either by 100% idmtity and 
70% overlap at one aid, or by 88% identity and 100% overlap at the other. A product score of 50 is 
produced dther by 100% idaitity and 50% ovarlap at one md. or 79% idmtity and 100% ovari^; 
Alternatively, polynucleotide sequmces encoding TRICH are analyzed with respect to the 

5 tissue sources from >?^ch they were derived. For exanq)le, sonie full Imgth sequences are assembled, 
at least in part, with overlapping IncytecDNA sequences (see Example III), Each cDNA sequence is 
dmved from a cDNA library constructed from a human tissue. Each human tissue is classified into one 
of the following organ/tissue categories: cardiovascular system; connective tissue; digestive system; 
mbryonic structures; endocrine system; exocrine glands; genitaUa, female; genitalia, male; germ cells; 

10 hemic and immune system; liver; musculoskeletal syst^; nervous systrai; pancreas; respiratory 

system; sense organs; skin; stomatognathic system; unclassified/mixed; or urinary tract The numbo: of 
libraries in each category is counted and divided by the total number of libraries aaoss an categories. 
Similarly, each human tissue is classified into one of the following disease/condition categories: canco:, 
cdl line, developmental, inflammation, neurological, trauma, cardiovascular, pooled, and other, and ttie 

15 numba: of libraries in each category is counted and divided by the total number of libraries across all 
categories. The resulting parcentages reflect the tissue- and disease-specific expression of cDNA 
encoding TRICH. cDNA sequmces and cDNA library/tissue information are found in tb& LBRESEQ 
GOLD database Oocyte G«iomics, Palo Alto CA). 
VIII, Extension of TRICH Encoding Polynucleotides 

20 Full lengfli polynucleotide sequmces were also produced by extension of an apprqpriate 

fragmmt of the full lengtti molecule using oligonucleotide primars designed from this fragmoit One 
primer was synthesized to initiate 5 ' ext^ion of tiie known fr agmrat, and tiie otha: primer was 
synthesized to initiate 3' extension of tiie known fragment. The initial primars were designed using 
OLIGO 4.06 software (National Biosdences). ot anotiio: appropriate program, to be about 22 to 30 

25 nucleotides in lengtti, to have a GC content of about 50% or more, and to anneal to flie target sequence 
at temperatures of about 68 "C to about IT C, Any sfretch of nucleotides which would result in hairpin 
structures and prhnor-prima: dimaizations was avoided 

Selected human cDNA libraries ware used to extend ttie sequence. If more flian one extension 
was necessary or desired, additional or nested sets of primers were designed. 

30 High fidelity amplification was obtained by PCR using mettiods wdl known in the art PGR 

was performed in 96-well plates usmg tiie PTC-200 tiiermal cycler (MJ Researdi, Inc.). The reaction 
mix contained DNA template, 200 nmol of each primer, reaction bufGa* containing Mg^% (^4)2804, 
and 2-mercaptoethanol, Taq DNA polymerase (Amersham Pharmacia Biotech), ELONGASE enzyme 
(life Technologies), and Pfii DNA polymerase (Stratagene), witii the following parameters for primer 
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pair PCI AandPCI B: Step 1: 94°C. 3 min; 2: 94°C. 15 sec; Stq) 3: 60"C. 1 min; Ste?) 4: 68'C, 
2 min; Stq) 5: Stq)S 2, 3, and 4 repeated 20 tiines; Stq) 6: 68»C, 5 min; Ste?) 7: storage at 4''C. mihe 
altemative. tbe parameters for primer pair T7 and SK+ were as follows: Step 1: 94«C, 3 min; St£p 2: 
94°C. 15 sec; Step 3: 57'C. 1 min; Step 4: 68'C. 2 min; Step 5: Stq)s 2. 3. and 4 rq)eated 20 times; 

5 Step 6: 68'C, 5 min; Step 7: storage at A'C. 

The concentration of DNA in each wdl was determined by dispensing 100 )d PICOGREEN 
quantitation reagent (0.25 % (v/v) PICOGREEN; Molecular Probes. Eugene OR) dissolved in IX TE 
and 0.5 ^1 of undiluted PGR product into each well of an opaque fluorimeter plate (Coming Costar, 
Acton MA), allowing the DNA to bind to me reagent. The plate was scanned in a Ftaoroskan n 

10 (Labsystems Oy, Hdsinki, Finland) to measure the fluorescence of the sample and to quantify the 

concaitration of DNA ASfAtolOfd aliquot of the reaction mixture was analyzed by electrophoresis 
on a 1 % agarose gd to determine which reactions were successful in extending the sequence. 

The extended nucleotides were desalted and concentrated, transferred to 384-well plates, 
digested Willi CviJI cholaa virus endonuclease (Molecular Biology Research, Madison WI), and 

15 sonicatedQrshearedpriortordigationintopUC18vectQr(AmershamPharmaciaBiotech). For 

shotgun sequencing, the digested nucleotides were separated on low concentration (0.6 to 0.8%) agarose 
gds. fragments were excised, and agar digested with Agar ACE (Promega). Extended clones were 
rcligated ustag T4 ligase (New England Biolabs, Beverly MA) hito pUC 18 vector (Amersham 
Pharmacia Biotech), treated with Pfa DNA polymerase (Stratagene) to fill-in restriction site overhangs, 

20 andtransfected into competently cdls. Transformed cdls were sdected on antibiotic-containing 
media, and individual colonies werepicked and cultured overnight at 37'"C in 384-wdl plates in LB/2x 
carb liquid media. 

The cells were lysed, and DNA vm amplified by PCR usmg Taq DNApolymerase (Amersham 
Pharmacia Biotech) and Pfo DNA polymerase (Stratagene) with the following parameters: Step 1: 

25 94»C. 3 min; Step 2: 94°C, 15 sec; Step 3: 60'C, 1 min; Stq) 4: 72°C, 2 min; Step 5: stqw 2, 3, and 4 
repeated 29 thnes; Step 6: 72»C, 5 min; Step 7: storage at 4«C. DNA was quantified by PICOGREEN 
reagent (Mdecular Probes) as described above. Samples with low DNA recoveries were rean?)lified 
using the same conditions as described above. San?>les were dfluted with 20% dimethysulfoxide (1:2, 
v/v), and sequenced using DYENAMIC energy transfer sequencing primers and tiie DYENAMIC 

30 DIRECT kit (Amersham Pharmacia Biotedi) or the ABI PRISM BIGDYE Terminator cyde 
sequaicing ready reaction kit (PE Biosystems). 

In like manner, full length polynucleotide sequences are verified using the above procedure or 
are used to obtain 5' regulatory sequences using the above procedure along with oligonudeotides 
designed for sudi edension, and an appropriate genomic library. 
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IX. Labeling and Use of Individual Hybridization Probes 

Hybridization probes derived from SEQ ID NO:28-54 are anployed to scre«i cDNAs, genomic 
DNAs, or mRNAs. Although the labeling of oligonucleotides, consisting of about 20 base pairs, is 
specifically described, essentially the same procedure is used with larger nucleotide fragm^its. 

5 Oligonucleotides are designed using state-of-the-art software such as OLIGO 4.06 software (National 
Bioscimces) and labeled by combining 50 pmol of each oligomer, 250 fxCi of [y-^] adenosine 
triphosphate (Amersham Pharmacia Biotech), and T4 polynudeotide kinase (DuPont NEN, Boston 
MA). The labeled oligonucleotides are substantially purified using a SEPHADEX G-25 suparfine size 
exclusion dextran bead column (Amersham Pharmacia Biotech). An aliquot containing W counts per 

10 minute of the labeled probe is used in a typical membrane-based hybridization analysis of human 

genomic DNA digested with one of the following endonucleases: Ase I, Bgl H, Eco RI, Pst I, Xba I, or 
Pvu n (DuPont NEN). 

The DNA from each digest is fractionated on a 0.7% agarose gel and transferred to nylon 
membranes (Nytran Plus, Schleicher & Schuell, Durham NH). Hybridization is carried out for 16 

15 hours at 40°C. To remove nonspecific signals, blots are sequentially washed at room tempo-ature 
under conditions of up to, for example, 0. 1 x saline sodium citrate and 0.5% sodium dodecyl sulfate. 
Hybridization patterns are visualized using autoradiography or an alternative imaging means and 
conoparedL 

X. Microarrays 

20 The linkage or synthesis of array dements upon a microarray can be achieved utilizing 

photolithography, piezoelectric printing (ink-jet printing. See, e.g., Baldeschwdler, supra ,\ mechanical 
microspotting technologies, and dmvatives thareof . The substrate in each of the aforementioned 
technologies should be uniform and solid with a non-porous surface (Sdiena (1999), supra\ Suggested 
substrates include silicon, silica, glass slides, glass chips, and silicon wafers. Alternatively, a procedure 

25 analogous to a dot or slot blot may also be used to arrange and link elements to the surface of a 
substrate using thermal, UV, chemical, or mechanical bonding procedures. A typical array may be 
produced using available methods and machines wdl known to those of ordinary skill in the art and may 
contain any appropriate number of dements. (See, e.g., Schena, M. et al. (1995) Science 270:467-470; 
Shalon, D. et al. (1996) Genome Res. 6:639-645; Marshall, A. and J. Hodgson (1998) Nat. Biotechnol. 

30 16:27-31.) 

Full length cDNAs, Expressed Sequence Tags (ESTs), or fragments or ohgomers thereof may 
comprise the elements of the microarray. Fragments or oligom^s suitable for hybridization can be 
selected using software well known in the art such as LASERGENE software (DNASTAR). The array 
dements are hybridized with polynucleotides in a biological sample. The polynudeotides in the 
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biological san^le are conjugated to a fluorescent labd or otber molecular tag for ease of detection. 
After hybridization, nonhybridized nucleotides from the biological smple are removed, and a 
fluoresceoce scanner is used to detect hybridization at each array demmt Alternatively, laser 
desorbtion and mass spectrometry may be used for detection of hybridization. The d^ee of 
S conq)lem^itarity and the relative abundance of each polynucleotide which hybridizes to an el^n^ on 
the mioroarray may be assessed. In one embodiment, miCToarray preparation and usage is described in 
detail below. 

Tissue or Cell Sample Preparation 

Total RNA is isolated from tissue samples using the guanidinium thiocyanate method and 
10 poly(A)^ RNA is purified using the oligo-(dT) cellulose method. Each polyCA)"*" RNA sample is 
reverse transcribed using MMLV reverse-transcriptase, 0.05 pg/|al oligo-(dT) primer (21mer), IX first 
strand buffer, 0.03 units/iJl RNase inhibitor, 500 \M dATP, 500 pM dGTP, 500 mM dTTP, 40 \M 
dCTP, 40 iulM dCTP-Cy3 (BDS) or dCTP-Cy5 (Amersham Pharmacia Biotech). The reverse 
transcription reaction is performed in a 25 ml volume containing 200 ng polyCA)"*^ RNA with 
15 GEMBRIGHT kits (Incyte). Specific control poly(A)^ RNAs are synthesized by in vitro transcription 
from non-coding yeast genomic DNA. After incubation at 37^ C for 2 hr, each reaction sample (one 
with Cy3 and another with Cy5 labeling) is treated with 2.5 ml of 0.5M sodium hydroxide and 
incubated for 20 minutes at 85° C to the stop the reaction and degrade the RNA. Samples are purified 
using two successive CHROMA SPIN 30 gel filtration spin columns (CLONTECH Laboratories, Inc. 
20 (CLONTECH), Palo Alto CA) and after combining, bolh reaction samples are ethanol precipitated 
using 1 ml of glycogen (1 mg/ml), 60 ml sodium acetate, and 300 ml of 100% ethanol. The sanq)le is 
then dried to completion using a SpeedVAC (Savant Instruments Inc., Holbrook NY) and 
resuspended in 14 pi 5X SSaO.2% SDS. 
Microarrav Preparation 

25 Sequences of the present invention are used to generate array elements. Each array element is 

amplified from bacterial cells containing vectors with cloned cDNA insats. PCR amplification uses 
primers complementary to the vector sequences flanking the cDNA insert Array elements are 
amplified in thirty cycles of PCR from an initial quantity of 1-2 ng to a final quantity greater than 5 
lig. Amplified array elements are then purified using SEPHACRYL-400 (Amersham Pharmacia 

30 Biotech). 

Purified array elements are immobilized on polymer-coated glass slides. Glass microscope 
slides (Coming) are cleaned by ulfrasound in 0.1 % SDS and acetone, with extensive distilled water 
washes between and after treatments. Glass slides are etched in 4% hydrofluoric acid (VWR 
Scientific Rroducts Corporation (VWR), West Chester PA), washed ^tensively in distiMed water, and 
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coated with 0.05% aminopropyl silane (Sigma) in 95% ettianol. Coated sUdes are cured in a 110°C 



oven. 



Array elements are applied to the coated glass substate using a procedure described in US 
Patent No. 5,807,522 , incorporated herein by reference. 1 [A of the array element DNA, at an average 
concentration of 1 00 ng/^l, is loaded into the open capfflary printing element by a high-speed robotic 
apparatus. Hie apparatus then deposits about 5 nl of array element sample per sUde. 

MicroairaysareUV-crosslinkedusingaSTOATALINKERUV-crosslinker (Stratagene). 

Microairays are washed at room temperature once in 0.2% SDS andlhree times in distilled wafer. 
Non-specific binding sites are blocked by incubation of microarrays in 0.2% casein hi phosphate 
buffered saline (PBS) (Tropix. Inc., Bedford MA) for 30 minutes at 60" C followed by washes in 
0.2% SDS and distilled water as before. 
Hybridization 

Hybridization reactions contain 9 pi of sample mixtore consisting of 0.2 ^g each of Cy3 and 
Cy5 labeled cDNA synthesis products in 5X SSC, 0.2% SDS hybridization buffer. THe sample 
mixture is heated to 65°C for 5 minutes and is aliquoted onto the microarray surface and covered with 
an 1.8 cm^ coversUp. THe arrays are transferred to a waterproof chamber having a cavity just sUghtly 
larger than a microscope slide. The chamber is kept at 1 00% humidity internally by the addition of 
140 Ml of 5X SSC ui a comer of the chamber. The chamber containing the arrays is incubated for 
about 6.5 hours at 60° C. Tbe arrays are washed for l0minat45<'Cinafirst wash buffer (IX SSC, 
0.1% SDS), three times for 10 nunutes each at 45" C in a second wash buffer (O.IX SSC). and dried. 
Detection 

Reports-labeled hybridization conq)lexes are detected with a microscope equipped with an 
mnovaTO mixed gas 10 Wlaser (Coh^nt. Inc., Santa QaraCA) capable of generating spectral lines 
at488mnforexcitationofCy3andat632mnforexcitationofCy5. Hie excitation laser light is 
focused on the array usmg a 20X microscope objective (Nikon. Inc., Melville NY). The sUde 
containing the array is placed on a computer-controlled X-Y stage on the miaoscope and raster- 
scanned past the objective. The 1.8 cmx 1.8 cm array used hi the present example is scanned with a 

resolution of 20 micrometers. 

In two separate scans. amhtedgasmultiUnelaser excites the two fluorophores sequentially. 
Emitted light is split, based on wavelength, into two photomultiplier tube detectors (PMT R1477, 
Hamamatsu Photonics Systems. Bridgewater NJ) conespondmg to the two fluorophores. Appropriate 
fflters positioned between the array and the photomuWpUertobes are used to filter the signals. The 

emission maxhna of the fluorophores used are 565 mn for Cy3 and 650 mn for Cy5. Each array is 
typically scanned twice, one scan per fluorophoreushig the ^jpropriatefiltm at the laser source, 
although the apparatos is capable of recording the spectra from both fluorophores sumiltaneously. 
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Tte sensitivity of the scans is typically calibrated using the signal intensity generated by a 
cDNA control species added to the sample mixture at a known concentratioa A specific location on 
the array contains a complementary DNA sequence, allowing the intensity of the signal at that 
location to be correlated with a weight ratio of hybridizing species of 1 :100,000. When two samples 

5 from different sources (e.g., representing test and control cells), each labeled with a different 

fluorophore, are hybridized to a single array for the purpose of identifying genes that are differentially 
expressed, the calibration is done by labding samples of the calibrating cDNA with the two 
fluorophores and adding identical amounts of each to the hybridization mixture. 

The output of the photomultipli^ tube is digitized using a 12-bit RTI-835H analog-to-digital 

10 (A/D) conversion board (Analog Devices, Inc., Norwood MA) installed in an IBM-compatible PC 
compute. The digitized data are displayed as an image where the signal intensity is mapped using a 
linear 20-color transformation to a pseudocolor scale ranging from blue (low signal) to red (high 
signal). The data is also analyzed quantitatively. Where two different fluorophores are excited and 
measured simultaneously, the data are first corrected for optical crosstalk (due to overlapping 

15 emission spectra) between the fluorophores using each fluorophore' s emission spectrum. 

A grid is superimposed over the fluorescence signal image such that the signal from each spot 
is centered in each element of the grid. The fluorescence signal within each element is then integrated 
to obtain a numerical value corresponding to the average intensity of the signal. The software used 
for signal analysis is the GEMTOOLS gene expression analysis program (Incyte). 

20 XL Complementary Polynucleotides 

Sequences complementary to the TRICH-encoding sequences, or any parts tiiereof, are used to 
detect, decrease, or mhibit expression of naturally occurring TRICH, Although use of oligonucleotides 
con5)rising from about 15 to 30 base pairs is described, essratially the same procedure is used with - 
smaller or with larger sequoice fragmsots. Appropriate oligonucleotides are designed using OLIGO 

25 4.06 software (National Biosdaaces) and tiie coding sequence of TRICH. To inhibit transcription, a 
con5)lementary oligonucleotide is designed from the most imique 5 ' sequence and used to prevent 
promoter binding to the coding sequence. To inhibit translation, a complemmtary oligonucleotide is 
designed to prevent ribosomal bindmg to the TRICH-encodmg fr anscript 
XII. Expression of TRICH 

30 E3q)ression and purification of TRICH is achieved using bacterial or virus-based expression 

systems. For expression of TRICH in bacteria, cDNA is subcloned into an appropriate vector 
containing an antibiotic resistance gene and an inducible promote: that directs high levds of cDNA 
transcription. Examples of such promoters include, but are not limited to, the trp4ac (tac) hybrid 
promoter and the T5 or T7 bacteriophage promoter in conjunction with the lac operator regulatory 
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demaiL Recombinant vectors are transfonned into suitable bactearial hosts, e.g., BL21(DE3). 
Antibiotic resistant bacteria express TRICHupanin(taction\(«his(5^ i 
tbiogalactppyranoside (IPTG). Expression of TRICH in eulcaryotic cdls is achieved by infecting insect 
or mannnalian cell lines with recombinant AiitograpMca califomica nuclear polyhedrosis virus 

5 (AcMNPV), commonly known as baculovirus. Thendnesseatialpciyiiedringeneofbaculovirusis 
replaced with cDNA aicoding TRICH by dthar homologous recombination or bacterial-mediated 
transposition involving transfer plasmid intermediates. Viral infectivity is maintained and the stijong 
polyhedrin promoter drives high levels of cDNA transcriptioa Recombinant baculovirus is used to 
ijQfect .Spndn pterafrugiperda (Sf9) insect cells in most cases, or human hqjatocytes. in some cases. 

10 Infection of the lattw requires additional genetic modifications to baculovirus. (See Engelhard, E.K. et 
al. (1994) Proc. Natl. Acad. Sci. USA 91:3224-3227; Sandig, V. et al. (1996) Hum GeneTher. 
7:1937-1945.) 

In most ejqpression systems, TRICH is synthesized as a fiasion protein with, e.g., glutathione S- 
traosferase (GST) or a peptide q)itopetag, such as FLAG ot 6-His, pemaitting rapid, single-step, 

15 affinity-based purification of recombinant fusion protein fl-om crude cell lysates. GST, a 26-ldlodalton 
enzyme from Schistosoma iaixMricum. aiaWes the purification of fusion proteins on immobilized 
ghitathione under conditions tiiat maintain protdn activity and antigsaiicity ( Amersham Pharmacia 
Biotech). Following purification, tiie GST moiety can be proteolytically deaved from TRICH at 
specifically engineored sites. FLAG, an 8-amino add pqrtide, enables hnnmnoafBnity purification 

20 ush«commerdany avaikblemonodonal and p(flydonalanti-FLAG antibodies (PEastmanKx^ 6- 
His, a stretch of sue consecutive histidine residues, aiables purification on m^-dieiate resins 
(QIAGEN). Methods for protdneaqiression and purification are discussed hi Ausubei (1995, stt^ 
di. 10 and 16). Purified TRICH obtained by these methods can be used directly in tt« assays shown m 
Exan?)les XVI, XVn, andXVm, whore applicable. 

25 XIII. Functional Assays 

TRICH function is assessed by expressing the sequences aicodmg TRICH at physiologically 
elevated levds in mammalian cell culture systems. cDNA is subdoned mto a mammafian expression 
vector containing a strong promoter that drives high levels of cDNA expressioa Vectors of dwdce 
mclude PCMV SPORT (Life Technologies) and PCR3. 1 (Invitrogen, Carlsbad CA), both of whldi 

30 contam tiie cytomegalovirus promoto". 5-10 fig of recombinant vector are transiently transfected mto a 
human ceiQ Ime, for example, an endothelial or hematopoietic cdl line, using dther hposome 
formulations or dectrppOTatioa 1-2 /^g of an additional plasmid contaimng sequences encoding a 
marker protdn are co-ti-ansfected. E}q)ression of a markff protdn provides a means to distinguish 
transfected cdls fixan nontransfected cdls and is a reJiable predictor of cDNA expression from tiie 
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recombinant vector. Marker proteins of choice include. e.g.. Green Fluorescent Protein (GFP; 
Clontech) CD64,oraCD64-GFPfusionprotdn. How cytwuetry (FCM). an automated, laser optics- 
based tecbnique. is used toidentify Ixansfected cdls expressing GFP or CD64-GFP and to evaluate the 
apoptoticstateofthecellsandoti^ercellularproperties. FCM detects and quantifies the uptake of 
fluorescent molecules that diagnose events preceding or coincident withcdl death. Tl.eseevents include 
changes innuclear DNAcontent as measuredby staining of DNAmthpropidium iodide; changes m 
cdlsizeandgranularityasmeasuredbyforwardUghtscatterandPOdegreesidelightscatter;^ 

regulationofDNA synthesis as measuredby decreaseinbromodeoxyuridineuptaker^^^^ 
expressionofcdl surface andintraceflularprotdns as measuredbyreactivityw^^ 
andatoadonsinplamamembranecompositionasmeasuredbythebindingoffluor^^^ 
AnnemVproteintothecdlsurface. Methods in flow cytometry are discussed in Ormerod.M.G. 

n Flow Cytometry. Oxford. New York NY. 

Theinfluence of TRICH on gene expression can be assessed using highly purified populations 

ofcdlstransfectedwithseqaencesencodingTlUCHanddtherCD64orCD^^^^ CDMand 
CD64-GFP aree^essed onlhe surface of transfected cdls andbindloconservedregions of hmnan 
ImmonoglobulinGagGyTransfectedceDsareeffidentlysqjaratedfromnontransf^ 

niagnetic beads coated with dther human IgG or antibody against CD64 (DYNAL. Lake Success NY). 
mRNAcanbepurifiedfiomthecdlsusingmethodswdlknownbythoseofskininiheart Expression 

of mRNA encodhig TRICH and other genes of interest can be analyzedby northern analysis or 

20 microarray techniques. 

XIV. Production of TRICH Specific Antibodies 

TRICH substantially purified ushigpolyacrylamide gd dectrophoresis (PAGE; see, e.g.. 
Harrington. MG. (1990) Mdhods En^l. 182:488-495). or other purification tedmiques. is usedto 
immunize rabbits and to produce antibodies using standard protocols. 

25 Alternatively, ttie TRICH amino add sequence is analyzed using LASERGENE software 

(DN ASTAR) to determine regions of high immunogenidty, and a corresponding digopqrtide is 
synthesized and used to raise antibodies by means knowntotiK)se of skill in the art. Mdhods for 
sdection of appropriate epitopes, such as those near theC-terminus or inhydrophilicr^ons arewdl 

describedinthe art. (See, e.g.. Ausubd. 1995, mm> ch- H-) 
30 Typically. oUgopeptides of about 15 residues in length are synthesized using an ABI 431 A 

peptide synthesize. (PE Biosystems) using FMOC chemistry and coupled to KLH (Sigma-Aldrid, St 
Louis MO) by reaction withN-maldmidobenzoyl-N-hydroxysucdnimide ester (MBS) to increase 
immunogenidty. (See. eg.. Ausubd. 1995. M-) Rabbits are immunized with the oUgopeptide-KLH 
complexincompld^Freund's adjuvant Resulting antisera are tested for antipeptide and anti-TRICH 
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activity by, for example, binding the pq)tide or TRICH to a substrate, blocking with 1 % BS A, reacting 
with rabbit antisera, washing, and reactuig with radio-iodinated goat anti-rabbit IgG. ' 

XV. Purification of Naturally Occurring TMCH Using Specific Antibodies 
Naturally occurring or recombinant TRICH is substantially purified by immunoaffinity 

5 chromatography using antibodies specific for TRICH. An immunoaffinity column is constructed by 
covalenlly coupling anti-TRICH antibody to an activated diromatographic resin, such as 
CNBr-activated SEPHAROSE (Amersham Pharmacia Biotech). Afl^ the coupling, the resin isl 
blocked and washed according to the manTifacturer's instructions. ■ 
Media containing TRICH are passed ovat the immunoaffinity column, and the column is 
10 washed imder conditions that allow the preferential absorbance of TRICH (e.g., high ionic strength 
buflers in the presence of detagent). The column is eluted under conditions that disrupt 
antibody/TRICH binding (e.g., a buffa- of pH 2 to pH 3, or a high concentration of a chaotrope, such 
as urea or thiocyanate ion), and TRICH is collected. 

XVI. Identification of Molecules Which Interact witti TRICH 

15 Molecules which interact with TRICH may include transports: substrates, agonists or 

antagonists, modulatory proteins such as Gpy proteins (Reimann, supra) or protdns mvolved m TRICH 
localization or clustering such as MAGUKs (Craven, sunra) . TRICH, or biologically active firagments 
tho-eof, are labeled wilii ^^I Bolton-Hunter reagmt (See, e,g., Bolton A.E. and W.M. Hunter (1973) 
Biochem. J. 1 33:529-539.) Candidate molecules previously arrayed in the wells of a multi-wdl plate 

20 are incubated with the labeled TRICH, washed, and any wells witii labeled TRICH complex are 
assayed. Data obtained using differast concentrations of TRICH are used to calculate values for the 
number, affinity, and association of TRICH witti the candidate molecules. 

Alternatively, proteins that interact with TRICH are isolated using the yeast 2-hybrid system 
(Fields, S. and O. Song (1989) Nature 340:245-246). TRICH, or fragmeaits thereof, are expressed as 

25 fusion proteins with the DN A bindmg domain of Gal4 or lexA and potential interacting protems are 
expressed as fusion proteins with an activation domain. Into'actions between the TRICH fusion protein 
and the reconstitutes a transactivation function that is observed by expression of a reporter gene. Yeast 
2-hybrid systems are commercially available, and metiiods for use of the yeast 2-hybrid system witii ion 
diannel proteins are discussed in Niethammer, M. and M. Sh^g (1998, Meth. Enzymol. 293:104-122). 

30 TRICH may also be used in the PATHCALLING process (CuraGen Corp., New Havaa CT) 

which employs the yeast two-hybrid system in a high-throughput mann^ to determine all interactions 
between tiie proteins encoded by two large libraries of genes (Nandabalan, K. et al. (2000) U.S. Patoit 
No. 6,057,101), 

Potential TRICH agonists or antagonists may be tested for activation or inhibition of TRICH 
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ion dbaimd activity using the assays described in section XVIII. 
XVn. Demonstration of TRICH Activity 

Ion channel activity of TRICH is demonstrated using an electrophysiological assay for ion 
conductance. TRICH can be expressed by transforming a mammalian cell line sudi as COS7, HeLa 

5 or CHO with a eukaryotic expression vector encoding TRICH. Eukaryotic ^ression vectors are 
commerciaUy available, and ttie techniques to introduce them into cdls are well known to those 
skiUed in the art. A second plasmid which expresses any one of a number of marker genes, such as B- 
galactosidase, is co-transformed into the ceUs to allow rapid identification of those cells which have 
taken up and expressed the foreign DNA. The cells are incubated for 48-72 hours after transformation 

10 under conditions appropriate for the cell line to allow expression and accumulation of TRICH and B- 
galactosidase. 

Transformed ceUs expressing 6-galactosidase are stamed blue when a suitable colorimetric 
subslrate is added to the culture media under conditions that are well known in the art. Stained ceUs 
are tested for differences m membrane conductance by electrophysiological techniques that are well 

15 known m ttie art Untransformed cells, and/or cells transformed with either vector sequences alone or 
6-galactosidase sequences alone, are used as controls and tested in parallel. Cells expressing TRICH 
win have high^ anion or cation conductance relative to control cells. Ttie contribution of TRICH to 
conductance can be confirmed by incubating the cells using antibodies specific for TRICH. The 
antibodies wffl buid to the extraceUular side of TRICH, thereby blocking the pore in the ion channel, 

20 and the associated conductance. 

AltOTiatively, ion channel activity of TRICH is measured as current flow across a TRICH- 
contahung Xenopus laevis oocyte membrane usmg the two-electrode voltage-clamp technique (Ishi et 
al., supra : Jegla, T. and L. Salkoff (1997) J. Neurosd. 17:32-44). TRICH is subcloned into an 
appropriate Xenopus oocyte expression vector, such as pBF, and 0.5-5 ng of mRNA is mjected mto 

25 mature stage IV oocytes. Injected oocytes are incubated at IS'C for 1-5 days. Inside-out 

macropatches are excised into an intracellular solution containing 1 16 mM K-gluconate, 4 mM KQ, 
and 10 mM Hepes ^H 7.2). The intracdlular solution Is supplemented witii varyhig concentrations 
of the TRICH mediator, such as cAMP, cGMP, or Ca*^ (in tixe form of CaQg), where appropriate. 
Electrode resistance is set at 2-5 MQ and electrodes are filled wifli the mtracellular solution lacking 

30 mediator. Experim^ are performed at room temperature from a holding potential of OmV. Voltage 
ramps (2.5 s) from -100 to 100 mV are acquired at a sampling frequency of 500 Hz. Current measured 
is proportional to the activity of TRICH m the assay. 

Transport activity of TRICH is assayed by measuring uptake of labded substrates mto 
Xenopus laevis oocytes. Oocytes at stages V and VI are injected witii TRICH mRNA (10 ng per 
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oocyte) and incubated for 3 days at 1 8°C in 0R2 roedinm (82^inM NaQ. 2.5 mM Ka ImM CaQ ^ 
ImM MgCla. InM Na^HPO^. 5 mM Hepes. 3.8 mM NkOH . SO^gM gectamydn. pH 7.8) to alKJw 
expression of TRICH. Oocytes are tben transferred to standard uptake medium (lOOmM NaCl, 2 mM 
KQ, ImM CaCl2. ImM MgCl^. 10 mM Hepesmis pH 7.5). Uptake of various substrates (e.g.. 
5 amino acids, sugars, drugs, ions, and neurotransmitters) is initiated by adding labeled substrate (e.g. 
radiolabeled with ^H. fluorescently labeled with rhpdamine, etc.) to the oocytes. After incubating for 30 
minutes, uptake is terminated by washing the oocytes three times m Na*-fi:ee medium, measuring tiie 
incQtporated label, and comparing with controls. TRICH activity is proportional to the levd of ! 

internalized labded substrate. 
10 ATPase activity associated with TRICH can be measured by hydrolysis of radiolabeled ATP- 

[Y-'2p], separation of the hydrolysis products by chromatographic methods, and quantitation of the 
recovered using a scintillation counter. The reaction mixture contains ATP-Iy-'^P] and varying 
amounts of TRICH in a suitable buffer incubated at 37''C for a suitable period of time. The reaction is 
terminatedby acid precipitation witiHricMoroacetic add and then neutralized with base, and an aliquot 
15 oftiiereactionmixtureis subjected to membrane or filler paper-based diromatography to sq)aral» the 
reactionproducts. The amount of '^Uberated is counted m a sdntillation counter. THeamountdf 
radioactivity recovered is proportional to the ATPase activity of TRICH in the assay. 
XVni. Identification of TRICH iV^onists and Ant^onists 

TRICH is expressed in a eukaryotic cell line aidi as CHO (CMnese Hanister Ovary) or HEK 
20 (Human Embryonic Kidney) 293. londiannel activity of tiie transformed cells is measuredinflie 
presence and absence of candidate agonists or antagonists. Ion diannd activity is assayed usmg patdi 
clampmethodswdlknowninttieartorasdescribedinExampleXVn. Alternatively. ion diannd 
activity is assayed using fluorescait tediniques that measure ion flux across tiie cdl monhrane 
(VeUcelebi, G. et al. (1999) MettL Enzymol. 294:20-47; West, M.R. and C.R. Molloy (1996) Anal. 
25 Biochem. 241:51-58). These assays may be adapted for high-throughput screening using microplates. 
Changes in internal ion concentration are measured using fluorescent dyes such as tiie Ca^ indicator 
Huo-4 AM. sodium-sensitive dyes sudi as SBH and sodium green, or tiie CI" indicator MQAE (all 
available from Molecular Probes) in combination witii tiie FLIPR fluorimetric plate reading system 
(Molecular Devices). In a more generic version of tins assay, changes m membrane potential caused by 
30 ionic flux across tiie plasma manhrane are measured using oxonyl dyes sudi as DiBAC4 (Molecular 
Probes). DiBAC4 equiUbrates between tiie esdracdlular solution and ceJlular sites according to ttie 
ceaiular membrane potential. The dye's fluorescence intensity is 20-fold greater when bound to 
hydrophobic inlracdlular sites, allowing detection of DiBAC* aitry into tiie cdl (Gonzalez. J.E. and 
P. A Negulescu (1998) Curr. Opin. BiotedmoL 9:624-631). Candidate agonists or antagonists may be 
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selected from known ion channel agonists or antagonists, peptide libraries, or conabinatorial diemical 
libraries. 



Various modifications and variations of the described methods and systeans of tiie invention wfll - 
5 be appareait to those skffled in the art without departing from the scope and spirit of the inv^on. 
Although the invention hias been described in connection with catain embodimoots, it should be 
understood that the invention as daimed should not be unduly limited to such specific embodiments. 
Indeed, various modifications of the described modes for carrying out the invoation which are obvious 
to those skilled in molecular biology or related fidds are intmded to be within the scope of the following 
10 claims. 
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What is claimed is: 

( 

1. An isolated polypqpttde comprising an amino add sequence selected ftom the groiq) 
consisting of: 

5 a) an amino acid sequence selected from the group consisting of SEQ ID NO: 1-27, 

b) a naturally occorring amino add sequence having at least 90% sequence idaitily to an 
amino add sequence selected from the group consisting of SEQ ID NO:l-27, I 

c) abiologically active fragment of an amino add sequence selected from the group , 

consisting of SEQ ID NO:l-27, and 
10 d) an immunogenic fragment of an amino acid sequence selected from the group consisting 

of SEQ ID NO: 1-27. 

2. An isolated polypeptide of claim 1 selected from the group consisting of SEQ ID NO: 1- 

27. 

15 

3. An isolated polynudeotide aacoding a polypeptide of claim 1 . 

4. An isolated polynudeotide encoding a polypeptide of claim 2. 

20 5. An isolated polynucleotide of claim 4 selected from the group consisting of SEQ ID 

NO:28-54. 

6. A recombinant polynudeotide comprising a promoter sequence operably linked to a 
polynudeotide of daim 3. 

25 

7. AcellfransformedwitharecombhiantpQlynncleotideofclaim6. 

8. A fransgenic organism comprising a recombinant polynucleotide of daim 6. 

30 9. A method for producing a polypqjtide of claim 1 , the method con5>rising: 

a) culturing a cell under conditions suitable for e}qaBssion of the polypq)tide, wherein said 
cell is fransformed wifli a recombinant polynucleotide, and said recombinant polynucleotide 
comprises a promote sequence operably Unked to a polynudeotide encoding the polyp^de of daim 
1, and 

35 b) recovering the polypeptide so expressed. 
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, Anisolatedaiitibody Which specmcally binds toapolypepttdeofdaiml. 



10, 



11. An 



isolated polynucleotide comprising a polynucleotide sequence selected from the 



5 



groiq) consisting of : ^mynnosKA 

a) a polynucleotide sequence selected from the group consisting of SEQ ID NO:28-54. 

b) anaturaUy occurring polynucleotide sequencehavmg atleast90% sequenceidentity to a 
polynudeotide sequence selected from the group consisting of SEQ ID NO:28-54. 

c) a pdynucleotide sequMice complementary to a), 

d) apolynndeotide sequence conq)lementary to b), and 
10 e) an RNA equivalent of a)-d). 

12. An isolated polynucleotide comprising at least 60 contiguous nucleotides of a 
polynudeotide of daim 1 1. 



15 



20 



25 



13. Amelhodfor deteding a target polynudeotide in a sample, said target polynudeotide 

having a sequence of a polymideotide of daim 1 1 . the method comprismg: 

a) hybridizingthesamplewithaprobecon^risingatleast20 contiguous nucleotides 

con,,risingasequence complem^ to said targetpolynudeotideinti^e sample, and w 

spedficallyhybridizesto saidtargetpolynucleotide. under conditions whereby a hybridization 
complex is fonnedbetweensaid probe and saidtargetpolynudeotide or fragments to^^^^^^ 

b) detedingtiie presence or absenceofsaidhybridization complex, and. optionally.if 

present, ttie amount thoreof . 

14. Ameflxodof Claim 13. wherdntixe probe comprises at least 60 contiguous nucleotides. 

15. Amethodfordetedingatargetpolynudeotideinasample.saidtargdpolym^^^^ 
havingasequenceofapolynudeotide of dahnll.tiie method comprishig: 

a) amplifying saidtarg^polynucleotideorfragmenttiiereofusingpolymera^^ 

amplification, and ^ ♦ 

b) detecting thepresence or absence of said amplified targetpolynudeottde or fragment 

thereof, and, optionally, if present, the amount thereof. 

16. Acompositioncomprisinganeffectiveamomitof apolypqjtideof daiml anda 
pharmaceuticany acceptable excipient 



35 
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17. A composition of claim 16. wherein the polyp^de comprises an amino add sequence 
selected from the group consisting of SEQ ID NO:l-27. 

18. A method for treatmg a disease or condition associated with decreased expressionof 
5 functionalTRICH.comprisingadmimsteringtoapatientinneedofsuchtieato 

claim 16. 

I 

I 

19. A method for saeening a compomid for effectiveness as an agonist of a polypepti(^ of 

daim 1, the method comprising: 
10 a) exposingasamplecomprisingapolypeptideofclaimltoacompomid,and 

b) detecting agonist activity in the sample. 

20. Acomposition comprising an agonist compound identifiedbyamethod of claim 19 and 
a pharmaceutically acceptable exdpient 

21 A method for treating a disease or condition associated with decreased expression of 
functionalTRICH. comprismg administeruxg tx, apatientinneed of suchtreato^^^ 

daim 20. 

20 22. AmefliodfarsaeeningaconqKmdforeffectivenessasanantagonistofapolypeptide 

of daim 1, ttie mefliod comprising: 

a) exposmg asamplecomprishig apolypeptideof claim 1 to acompomid. and 

b) detecting antagonist activity in the sample. 

25 23. Acompositioncomprisinganantagonistcompoundidentifiedbyamethodofclaim22 

and apharmaceatically acceptable exdpirait 

24. A method for treating a disease or condition associated wilh overexpression of functional 
TOCH, comprising admhusteringtoapatientinneed of suditi^entacomposition of c^ 

30 . , • 

25. Amethodofscrearingforaconpjundthatspedficallybindstotiiepolyp^^ 

1, said method conpising the steps of: 

a) combmmglhepdypqrtideofdaimlmtiiatleastonetestcompomKlundiTSui^^^ 

conditions, and 

35 b) detectingbindingoftiiepolypeptideofdahnl1x)tiietestcon?)Ound.ther^^ 
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compound that specifically binds to the polypeptide of claim 1. 

26. AmethodofscreeningforacompoundthatmoduMestheactiA^yoflhepolypqrtideof 

claim 1, said method comprising: 
5 a) combiningthepolypeptideofclaimlwithatleastonetestcompoundunderconditions 

permissive for the activity of the polypeptide of claim 1, 

b) assesstagtheactivity of thepolypq>tideof claim 1 in the presence of the test compound, and 

c) comparingtheactivity ofthepolypeptideof claiml inthepresenceof thetestconqjomid 

vvithihe activity of thepdypejrtide of dahnlinthe absence of the tes^ 
10 theactivityofthepolypeptideofclatolinthepresenceofthetestcompomMlisindi^^^ 

confound that modulates the activity of the polypeptide of daim 1. 

27. Amethodforscreeningacompomidfor effectiveness inaltering expression ofat^^^ 
polynucleotide, wherein said target pcdynudeotidecon^jrises a se(paK» of dahn 5. the m^ 
15 con?)rising: 

a) exp<^ngasamplecomprising1hetargetpdynucleotidetoacompound.undert^^ 

suitable for the eaq)ression of the target polynucleotide, 

b) detecting altered expression of the targetpdynudeotide. and 

c) comparingtheexpressionofthetargetpolynudeotideinthepresenceofvaryingam^ 
20 the confound and in the absence of the compound. 

28. A method for assessing toxicity of atest compound, said method comprising: 

a) treating abiological sample containing nucldc acids xviththe test conqxjund; 

b) hybridizing the nucleic acids of the treated biological sample with a probe con^sing at 
25 least 20 contiguous nucleotides of a polynucleotide of claun 1 1 under conditions whereby a spedfic 

hybridization complex is formed between said probe and a target polynudeotide in the biological 
sample, said target polymicleotidecomprisingapolynucleotidesequenceofa^^^^ 

1 1 or fragmoit thereof; 

c) quantifying the amount of hybridization complex; and 

30 d) (x)n5,aiingtheamountofhybridizationcomplexinthetreatedbiologicalsamplewithfee 

amomrt of hybridization complex in an mitreated biological sample, wherein a difference in the 
amount of hybridization complex in the treated biological sample is indicative ^ 

compound. 
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<210> 1 

<211> 477 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc^feature 

<223> Incyte ID No: 1416107CD1 



Met^ihr Pro Glu Asp Pro Glu Glu Thr Gin Pro Leu Leu Gly Pro 
- 5 10 -^--^ 

Pro Gly Gly Ser Ala Pro Arg Gly Arg Arg Val Phe Leu Ala Ala 

Phe Ala Ala Ala Leu Gly Pro Leu Ser Phe Gly Phe Ala Leu Gly 

35 40 
Tyr ser Ser Pro Ala lie Pro Ser Leu Gin Arg Ala Ala Pro Pro 

50 

Ala Pro Arg Leu Asp Asp Ala Ala Ala Ser Trp Phe Gly Ala Val 

65 '0 
val Thr Leu Gly Ala Ala Ala Gly Gly Val Leu Gly Gly Trp Leu 

val Asp Arg Ala Gly Arg Lys Leu Ser Leu Leu Leu cys Ser Val 

Pro Phe Val Ala Gly Phe Ala Val He Thr Ala Ala Gin Asp Val 

110 

Trp Met Leu Leu Gly Gly Arg Leu Leu Thr Gly Leu Ala Cys Gly 

125 -^^^ 
val Ala ser Leu Val Ala Pro Val Tyr lie Ser Glu He Ala Tyr 

140 T 
Pro Ala val Arg Gly Leu Leu Gly Ser Cys Val Gin Leu Met Val 

155 T 
Val val Gly He Leu Leu Ala Tyr Leu Ala Gly Trp Val Leu Glu 

170 1. 
Trp Arg Trp Leu Ala Val Leu Gly Cys Val Pro Pro Ser Leu Met 
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pro Phe Ila He Gly Val Sar Leu Met Ala Phe Gin Gin Leu Ser 



Oly val Asn Ala HI Met Phe O^r Ala Glu He Phe Glu Glu 
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185 l^^ 
Leu Leu Leu Met cys Phe Met Pro Glu Thr Pro Arg Phe Leu Leu 

OJhr Gin His Arg l^l Gin Glu Ala Met Ala Ala Leu Arg Phe Leu 

Xrp Gly ser Glu Gin Gly Trp Glu Asp Pro Pro He Gly Ala Glu 

Gin ser Phe His fen Ala Leu Leu Arg Gin Pro Gly He Oyr Lys 

Val Ser Leu Met 

Met Phe Tyr Ala 

Ser Ser Leu Ala 

Thr Ala Val Ala 

Ala Gly Arg Arg Leu Leu Leu Val Leu Ser Gly Val Val Met Val 

Phe ser Thr Ser III Phe Gly Ala Tyr Phe Lys Leu Thr Gin Gly 

Gly pro Gly Asn Ser Ser His Val Ala He Ser Ala Pro Val Ser 

Ala Gin Pro Val lip Ala Ser Val Gly Leu Ala Trp Leu Ala Val ' 

aly ser Met Cys Leu Phe He Ala Gly Phe Ala Val Gly Trp Gly 



Ala Lys Phe Lys Sp Ser Ser Leu Ala Ser Val Val Val Gly Val 



280 



lie Gin val Leu Phe Thr Ala Val Ala Ala Leu He Met Asp Arg 



295 



Pro He pro Trp Leu Leu Met Ser Glu HJ Phe Pro Leu His Val 



Lys Gly val Ala ^r Gly He Cys Val Leu Thr Asn 1^ Leu Met 



400 



Ala Phe Leu Val J^r Lys Glu Phe Ser Ser Leu Met Glu Val Leu 



415 



Arg Pro Tyr Gly All Phe Trp Leu Ala Ser Ala Phe Cys He Phe 



430 



ser val Leu Phe Leu Phe cys Val Pro Glu Thr Lys Gly Lys 



445 



455 460 



Thr Leu Glu Gin He Thr Ala His Phe Glu Gly Arg 



470 ^'7^ 



<210> 2 

<211> 498 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc^feature 

<223> incyte ID No: 1682513CD1 



Se^Lg Arg Gin Asp Ser Arg Gly Asn 0^ Val Leu His Ala Leu 
Vai Ala He Ala AsJ Asn Thr Arg Glu Asn Thr Lys Phe Val Thr 
Lys Met Tyr Asp dn Leu Leu Leu Lys Cys Ala Arg Leu Phe Pro 
Asp ser Asn Leu gIu Ala Val Leu Asn Asn Asp Gly Leu Ser Pro 
Leu Met Met Ala A^a Lys Thr Gly Lys He Gly Asn Arg His Glu 



Met Leu Ala Val Glu Pro He Asn Glu Leu Leu Arg Asp Lys Trp 



Arg Lys Phe Gly Ala Val Ser Phe Tyr lie Asn Val Val Ser Tyr 

Thr 

110 ^ 
Leu Glu Gly Thr Pro Pro Tyr Pro Tyr Arg Thr Thr Val Asp Tyr 



Leu cys Ala Met Val He Phe S^i.r Leu Thr Ala Tyr Tyr Gin Pro 
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Leu Arg I^eu Ala Glu Val He Leu Phe Thr Gly Val iju 
Phe Phe Phe Thr HI He Lys Asp Leu Phe Met Lys Lys Cys Pro 
Gly val Asn Ser ieu Phe He Asp Gly Ser Phe Gin Leu Leu Tyr 
Phe lie ^ ser Z Leu Val He Val Ser Ala Ala Le^ ^ Leu 
Ala Gly lie Glu Tyr Leu Ala Val Met Val Phe Ala Leu Val 
Leu Cly Trp Met Ala Leu Tyr Phe Thr Arg Gly Leu Lys Leu 
Thr Gly ^ Tyr Ser He Met He Gin Lys lie Leu Phe Lys Asp 
Leu Phe Arg Phe feu Leu Val Tyr Leu Leu Phe Met lie Gly Tyr 
Ala ser Ala Leu vfl Ser Leu Leu Asn Pro cys Ala Asn Met Lys 
val cys Asn Gly J^P Gin Thr Asn Cys Thr Val Pro Thr Tyr Pro 
ser cys Arg Asp fer Glu Thr Phe Ser Thr Phe Leu Leu Asp Leu 
Phe Lys Leu Thr fie Gly Met Gly Asp Leu Glu Met Leu Ser Ser 
Thr Lys Tyr Pro Val Val Phe He He Leu Leu Val Thr .^yr He 
lie Leu Thr Phe III Leu Leu Leu Asn Met Leu He Ala Leu Met 
Gly Glu Thr val c'ly Gin Val Ser Lys Glu Ser Lys His He Trp 
Lys Leu Gin Trp fla Thr O^r He Leu Asp He Glu Arg Ser Phe 
pro val Phe Leu l^'g Lys Ser Phe Arg Ser Gly Glu Met Val Thr 



335 
Gly 
350 
Ala 
365 

Lvs Ser Phe Arg 

- 3 85 

val Gly Lys Ser HI Asp Gly Thr Pro Asp Arg Arg Trp Cys Phe 
Arg val Asp Glu Val Asn Trp Ser His Trp Asn Gin Asn Leu Gly 
lie He Asn Glu ^P Pro Gly Lys Asn Glu Thr Tyr Gin Tyr Tyr 
Gly Phe ser His SI Val Gly Arg Leu Arg Arg Asp Arg Trp ser 
ser val Val Pro ^g Val Val Glu Leu Asn Lys Asn ser Asn Pro 



At^c^ 

ASP Glu val val vll Pro Leu Asp Ser Thr Gly Asn Pro Arg Cys 
ASP Gly His Gin Gin Gly Tyr Pro Arg Lys Trp Arg Thr Asp Asp 
Ala Pro Leu 



<210> 3 

<211> 764 

<212> PRT 

<213> Homo sapiens 

<220> 



<221> misc_feature 
<223> Incyte ID No: 2446438CD1 

Strihr ser Pro Ser Ser Ser Pro Val Phe Arg Leu Glu Thr Leu 
ASP Ala Gly Gin Glu Asp Gly Ser Glu Ala Asp Arg Gly Lys Leu 
ASP Phe Gly ser cly Leu Pro Pro Met Glu Ser Gin Phe Gin Gly 
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35 40 45 

Glu Asp Arg Lys Phe Ala Pro Gin lie Arg Val Asn Leu Asn Tyr 
.50 55 . oO 

Arg Lys Gly Thr Gly Ala Ser Gin Pro Asp Pro Asn Arg Phe Asp 
65 70 75 

Arg Asp Arg Leu Phe Asn Ala Val Ser Arg Gly Val Pro Glu Asp 
80 85 90 

Leu Ala Gly Leu Pro Glu Tyr Leu Ser Lys Thr Ser Lys Tyr Leu 
95 100 105 

Thr Asp Ser Glu Tyr Thr Glu Gly Ser Thr Gly Lys Thr Cys Leu 
110 115 120 

Met Lys Ala Val Leu Asn Leu Lys Asp Gly Val Asn Ala Cys lie 
125 130 135 

Leu Pro Leu Leu Gin He Asp Arg Asp Ser Gly Asn Pro Gin Pro 
140 145 150 

Leu Val Asn Ala Gin Cys Thr Asp Asp Tyr ayr* Arg Gly His Ser 
155 160 165 

Ala Leu His He Ala He Glu Lys Arg Ser Leu Gin cys Val Lys 
170 175 180 

Leu Leu Val Glu Asn Gly Ala Asn Val His Ala Arg Ala Cys Gly 
185 190 195 

Arg Phe Phe Gin Lys Gly Gin Gly Thr cys Phe Tyr Phe Gly Glu 
200 205 210 

Leu Pro Leu Ser Leu Ala Ala Cys Thr Lys Gin Trp Asp Val Val 
215 220 225 

Ser Tvr Leu Leu Glu Asn Pro His Gin Pro Ala Ser Leu Gin Ala 
230 235 240 

Thr Asp Ser cin Gly Asn Thr Val Leu His Ala Leu Val Met lie 
245 250 255 

Ser Asp Asn Ser Ala Glu Asn He Ala Leu Val Thr Ser Met Tyr 
260 265 270 

Asp Gly Leu Leu Gin Ala Gly Ala Arg Leu Cys Pro Thr Val Gin 
275 280 285 

Leu Glu Asp He Arg Asn Leu Gin Asp Leu Thr Pro Leu Lys Leu 
290 295 300 

Ala Ala Lys Glu Gly Lys He Glu He Phe Arg His He Leu Gin 
305 310 315 

Ara Glu Phe Ser Gly Leu Ser His Leu Ser Arg Lys Phe Thr Glu 
^ 320 325 330 

TriD CVS Tyr Gly Pro Val Arg Val Ser Leu Tyr Asp Leu Ala Ser 
335 340 345 

Val Asp Ser Cys Glu Glu Asn Ser Val Leu Glu He He Ala Phe 
350 355 360 

His cys Lys Ser Pro His Arg "His Arg Met Val Val Leu Glu Pro 
365 370 375 

Leu Asn Lys Leu Leu Gin Ala Lys Trp Asp Leu Leu He Pro Lys 
380 385 390 

Phe Phe Leu Asn Phe Leu Cys Asn Leu He Tyr Met Phe He Phe 
395 400 405 

Thr Ala Val Ala Tyr His Gin Pro Thr Leu Lys Lys Gin Ala Ala 
410 415 420 

Pro His Leu Lys Ala Glu Val Gly Asn Ser Met Leu Leu Thr Gly 
425 430 435 

His He Leu He Leu Leu Gly Gly He Tyr Leu Leu Val Gly Gin 
440 445 450 

Leu Trp Tyr Phe Trp Arg Arg His Val Phe He Trp He Ser Phe 
455 460 465 

He ASP Ser Tyr Phe Glu He Leu Phe Leu Phe Gin Ala Leu Leu 
470 475 480 

Thr Val Val Ser Gin Val Leu Cys Phe Leu Ala He Glu Trp Tyr 
485 490 495 

Leu Pro Leu Leu Val Ser Ala Leu Val Leu Gly Trp Leu Asn Leu 
500 505 510 

Leu Tyr Tyr Thr Arg Gly Phe Gin His Thr Gly He Tyr Ser Val 
515 520 525 

Met He Gin Lys Val He Leu Arg Asp Leu Leu Arg Phe Leu Leu 
530 535 540 
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lie Tyr Leu Val Phe Leu Phe Gly Phe Ala Val Ala Leu Val Ser 

545 550 
Leu ser Gin Glu Ala Trp Arg Pro Glu Ala Pro Thr Gly Pro Asn 

560 

Ala -rbx Glu ser Val Gin Pro Met Glu Gly Gin Glu Asp Glu Gly 

575 580 
Asn Gly Ala Gin Tyr Arg Gly He Leu Glu Ala Ser Leu Glu Leu 

590 595 . ouw 

Phe Lys Phe -Ehr He Gly Met Gly Glu Leu Ala Phe Gin Glu Gin 

605 ^-^^ 
Leu His Phe Arg Gly Met Val Leu Leu Leu Leu Leu Ala Tyr Val 

620 ^25 
Leu Leu Thr Tyr He Leu Leu Leu Asn Met Leu He Ala Leu Met 

635 

ser Glu -Thr Val Asn Ser Val Ala Thr Asp Ser Trp Ser He Trp 
650 

Lys Leu Gin Lys Ala He Ser Val Leu Glu Met Glu Asn Gly Tyr 
665 ? 
Trp Trp cys Arg Lys Lys Gin Arg Ala Gly Val Met Leu Thr Val 

680 

Gly Thr Lys Pro Asp Gly Ser Pro Asp Glu Arg Trp Cys Phe Arg 

695 

val Glu Glu Val Asn Trp Ala Ser Trp Glu Gin Thr Leu Pro Thr 

710 

Leu cys Glu ASP Pro Ser Gly Ala Gly Val Pro Arg Thr Leu Glu 
725 

Asn pro val Leu Ala Ser Pro Pro Lys Glu Asp Glu Asp Gly Ala 

740 745 
ser Glu Glu Asn Tyr Val Pro Val Gin Leu Leu Gin Ser Asn 

755 760 

<210> 4 

<211> 255 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> inisc_feature 

<223> Incyte ID No: 2817822CD1 

Me^'Jrp Gin Gly Cys Ala Val Glu Arg Pro Val Gly Arg Met Thr 

1 5 
ser Gin Thr Pro Leu Pro Gin Ser Pro Arg Pro Arg Arg Pro Thr 

Met ser Thr Val Val Glu Leu Asn Val Gly Gly Glu Phe His Thr 

Thr Thr Leu Gly T^r Leu Arg Lys Phe Pro Gly Ser Lys Leu Ala 

50 5^ 
Glu Met Phe ser Ser Leu Ala Lys Ala Ser Thr Asp Ala Glu Gly 



65 



Arg Phe Phe He Asp Arg Pro Ser Thr Tyr Phe Arg Pro He Leu 

85 •'^ 



gQ 

Asp Tyr Leu Arg Thr Gly Gin Val Pro Thr Gin His He Pro Glu 



Val Tyr Arg Glu All Gin Phe Tyr Glu He Lys Pro Leu Val Lys 



100 105 



110 — - 

Leu Leu Glu Asp Met Pro Gin He Phe Gly Glu Gin Val Ser Arg 



115 120 



225 

LYS Gin Phe Leu Leu Gin Val Pro Gly Tyr Ser Glu Asn Leu Glu 

140 145 
Leu Met Val Arg Leu Ala Arg Ala Glu Ala He Thr Ala Arg Lys 
155 -^"-^ 
ser ser Val Leu Val cys Leu Val Glu Thr Glu Glu Gin Asp Ala 

170 1"^^ 
Tyr Tyr Ser Glu Val Leu cys Phe Leu Gin Asp Lys Lys Met Phe 
ire; 190 



130 135 
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Lys Ser Val Val Lys Phe Gly Pro Trp Lys Ala Val Leu Asp Asn 
^ 200 205 210 

Ser ASP Leu Met His Cys Leu Glu Met Asp lie Lys Ala Gin Gly 

215 220 225 

Tvr Lvs Val Phe Ser Lys Phe Tyr Leu TThr Tyr Pro Thr Lys Arg 

230 235 240 

Asn Glu Phe His Phe Asn He Tyr Ser Phe Thr Phe Thr Trp Trp 

245 250 255 



<210> 5 

<211> 584 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 4009329CD1 

<400> 5 

Met Ala Gly Arg Arg Leu Asn Leu Arg Trp Ala Leu Ser Val Leu 
1-5 10 15 

CVS Val Leu Leu Met Ala Glu Thr Val Ser Gly Thr Arg Gly Ser 

20 25 30 

Ser Thr Gly Ala His He Ser Pro Gin Phe Pro Ala Ser Gly Val 

35 40 45 

Asn Gin Thr Pro Val Val Asp cys Arg Lys Val cys Gly Leu Asn 

50 55 Y 

Val Ser Asp Arg Cys Asp Phe He Arg Thr Asn Pro Asp Cys His 

65 70 75 

^Ser Asp Gly Gly Tyr Leu Asp Tyr 

80 85 90 

Phe Pro Pro Ser Leu Leu Pro Leu Ala Val Thr Leu Tyr Val Ser 

95 100 105 

Trp Leu Leu Tyr Leu Phe Leu He Leu Gly Val Thr Ala Ala Lys 

110 115 J-20 

Phe Phe cys Pro Asn Leu Ser Ala He Ser Thr Thr Leu Lys Leu 

125 130 135 

Ser His Asn Val Ala Gly Val Thr Phe Leu Ala Phe Gly Asn Gly 

140 145 150 

Ala Pro Asp He Phe Ser Ala Leu Val Ala Phe Ser Asp Pro His 

155 160 165 

Thr Ala Gly Leu Ala Leu Gly Ala Leu Phe Gly Ala Gly Val Leu 

170 175 180 

Val Thr Thr Val Val Ala Gly Gly He Thr He Leu His Pro Phe 

185 190 195- 

Met Ala Ala Ser Arg Pro Phe Phe Arg Asp He Val Phe Tyr Met 

200 205 210 

Val Ala Val Phe Leu Thr Phe Leu Met Leu Phe Arg Gly Arg Val 

215 220 225 

Thr Leu Ala Trp Ala Leu Gly Tyr Leu Gly Leu Tyr Val Phe Tyr 

230 235 .240 

Val Val Thr Val He Leu Cys Thr Trp He Tyr Gin Arg Gin Arg 

245 250 255 

Arg Gly Ser Leu Phe Cys Pro Met Pro Val Thr Pro Glu He Leu 

260 265 270 

Ser Asp Ser Glu Glu Asp Arg Val Ser Ser Asn Thr Asn Ser Tyr 

275 280 285 

Asp Tyr Gly Asp Glu Tyr Arg Pro Leu Phe Phe Tyr Gin Glu Thr 

290 295 300 

Thr Ala Gin He Leu Val Arg Ala Leu Asn Pro Leu Asp Tyr Met 

305 310 315 

Lvs Trp Arg Arg Lys Ser Ala Tyr Trp Lys Ala Leu Lys Val Phe 

320 325 330 

Lys Leu Pro Val Glu Phe Leu Leu Leu Leu Thr Val Pro Val Val 
^ 335 340 345 

Asp pro Asp Lys Asp Asp Gin Asn Trp Lys Arg Pro Leu Asn Cys 
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350 



355 360 



Leu His Leu Val He Ser Pro Leu Val Val Val Leu Thr Leu Gin 
365 ' ^ T 

ser Gly Thr Tyr Gly Val Tyr Glu He Gly Gly Leu Val Pro Val 

380 -^oD 
Trp val val Val Val He Ala Gly Thr Ala Leu Ala Ser Val Thr 

Phe Phe Ala Thr III Asp Ser Gin Pro Pro Arg Leu His Trp Leu 

410 ^-^^ 1 

Phe Ala Phe Leu Gly Phe Leu Thr Ser Ala Leu Trp He Asn Ala 

Ala Ala Thr Glu Val Asn He Leu Arg Ser Leu Gly Val Val 

440 1 m 

Phe Arg Leu Ser Asn Thr Val Leu Gly Leu Thr Leu Leu Ala Trp 



Gly Asn ser He J!y Asp Ala Phe Ser A^p Phe Thr Leu Ala Arg 

470 T 
Gin Gly Tyr Pro Arg Met Ala Phe Ser Ala Cys Phe Gly Gly lie 

lie Phe Asn He til Val Gly Val Gly Leu Gly Cys Leu Leu Gin 
500 

He ser Arg Ser His Thr Glu Val Lys Leu Glu Pro Asp Gly Leu 

515 b^v 
Leu val Trp Val Leu Ala Gly Ala Leu Gly Leu Ser Leu Val Phe 

ser Leu Val Ser VaJ Pro Leu Gin Cys Phe Gin Leu Ser Arg Val 

545 n T 

Tyr Gly Phe Cys Leu Leu Leu Phe Tyr Leu Asn Phe Leu Val Val 

560 bob 
Ala Leu Leu lie Glu Phe Gly Val He His Leu Lys Ser Met 



<210> 6 

<211> 416 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> miscfeature 

<223> Incyte ID No: 6618083CD1 

M^J^Lys Leu ser Lys Lys Asp Arg Gly Glu Asp Glu Glu Ser Asp 

1 5 T 

ser Ala Lys Lys Lys Leu Asp Trp Ser Cys Ser Leu Leu Val Ala 
20 

ser Leu Ala Gly Ala Phe Gly Ser Ser Phe Leu Tyr Gly T/r Asn 



Leu ser Val Val A^n Ala Pro Thr Pro He Lys Ala Phe Tyr 



Asn Glu ser Trp Glu Arg Arg His Gly i^g Pro He Asp Pro Asp 



55 60 



■mr Leu Thr Leu Leu Trp Ser Val Thr Val Ser He Phe Ala He 
Gly Gly Leu Val g?J -Itor Leu He Val Lys Met He Gly Lys Val 

Leu Gly Arg Lys His Thr Leu Leu Ala Asn Asn Gly Phe Ala He 

110 lib 
ser Ala Ala Leu Leu Met Ala Cys Ser Leu Gin Ala Gly Ala Phe 

125 T 
Glu Met Leu He Val Gly Arg Phe He Met Gly He Asp Gly Gly 

140 "^^-^ 
val Ala Leu Ser Val Leu Pro Met Tyr Leu Ser Glu He Ser Pro 

155 n 

Lys Glu He Arg Gly Ser Leu Gly Gin Val Thr Ala He Phe He 



70 75 



cys He Gly Val Phe Thr Gly Gin Leu l4u Gly Leu Pro Glu Leu 
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185 190 
Leu Gly Lys Glu Ser Thr Trp Pro Tyr Leu Phe Gly Val lie Val 

200 205 , 210 , 

Val Pro Ala Val Val Gin Leu Leu Ser Leu Pro Phe Leu Pro Asp 

215 220 225 

ser Pro Arg Tyr Leu Leu Leu Glu Lys His Asn Glu Ala Arg Ala 

230 235 240 

Val Lys Ala Phe Gin Thr Phe Leu Gly Lys Ala Asp Val Ser Gin 

245 250 255 

Glu Val Glu Glu Val Leu Ala Glu Ser His Val Gin Arg Ser lie 

260 265 270 

Arg Leu Val Ser Val Leu Glu Leu Leu Arg Ala Pro Tyr Val Arg 

275 280 285 j 

Trp Gin Val Val Uhr Val He Val Thr Met Ala Cys Tyr Gin Leu , 

290 295 300 

cys Gly Leu Asn Ala He Trp Phe Tyr Thr Asri Ser He Phe Gly 
' 305 310 315 

Lys Ala Gly He Pro Leu Ala Lys He Pro Tyr Val Thr Leu Ser 

320 325 330 

Thr Gly Gly He Glu Thr Leu Ala Ala Val Phe Ser Gly Leu Val 

335 340 •^43 

He Glu His Leu Gly Arg Arg Pro Leu Leu He Gly Gly Phe Gly 

350 355 360 

Leu Met Gly Leu Phe Phe Gly Thr Leu Thr He Thr Leu Thr Leu 

365 370 375 

Gin Asp His Ala Pro Trp Val Pro Tyr Leu Ser He Val Gly He 

380 385 390 

Leu Ala He He Ala Ser Phe Cys Ser Gly Pro Ala Val Phe Pro 

395 400 405 

Glu Glu Thr Val Asn Val Ser He Val Ser Glu 
410 415 



<210> 7 

<211> 664 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> inisc_f eature 

<223> Incyte ID No: 7472002CD1 



Met°Tta Glu Lys Thr Asn Gly Val Lys Ser Ser Pro Ala Asn Asn 

15 10 
His Asn His His Ala Pro Pro Ala He Lys Ala Asn Gly Lys Asp 

20 25 30 

Asp His Arg Thr Ser Ser Arg Pro His Ser Ala Ala Asp Asp Asp 

35 40 
Thr Ser Ser Glu Leu Gin Arg Leu Ala Asp Val Asp Ala Pro Gin 

50 55 60 

Gin Gly Arg Ser Gly Phe Arg Arg He Val Arg Leu Val Gly lie 

65 70 "5 

He Arg Glu Trp Ala Asn Lys Asn Phe Arg Glu Glu Glu Pro Arg 

80 85 
Pro Asp ser Phe Leu Glu Arg Phe Arg Gly Pro Glu Leu Gin Thr 

Val Thr Thr Gin Glu Gly Asp Gly Lys Gly Asp Lys Asp Gly Glu 
110 115 -l-^" 

Asp Lys Gly Thr Lys Lys Lys Phe Glu Leu Phe Val Leu Asp Pro 

Ala Gly Asp Trp Tyr Tyr Cys Trp Leu Phe Val He Ala Met Pro 

140 145 
Val Leu Tyr Asn Trp Cys Leu Leu Val Ala Arg Ala cys Phe Ser 

155 160 lo= 

Asp Leu Gin Lys Gly Tyr Tyr Leu Val Trp Leu Val Leu Asp Tyr 

170 175 l''" 

Val Ser Asp Val Val Tyr He Ala Asp Leu Phe He Arg Leu Arg 
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185 190 195 

Thr GlY Phe Leu Glu Gin Gly Leu Leu Val Lys Asp Thr Lys Lys 

200 205 210 

Leu Arg Asp Asn Tyr He His Thr Leu Gin Phe Lys Leu Asp Val 

215 220 225 

Ala Ser He He Pro Thr Asp Leu He Tyr Phe Ala Val Asp He 

230 235 240 

His Ser Pro Glu Val Arg Phe Asn Arg Leu Leu His Phe Ala Arg 

245 250 255 

Met Phe Glu Phe Phe Asp Arg Thr Glu Thr Arg Thr Asn Tyr Pro 

260 265 2/0 

Asn He Phe Arg He Ser Asn Leu Val Leu Tyr He Leu Val He 

275 280 285 

He His Trp Asn Ala Cys He Tyr Tyr Ala He Ser Lys Ser He 

290 295 
Gly Phe Gly Val Asp Thr Trp Val Tyr Pro Asn He Thr Asp Pro 

Glu Tyr Gly Tyr Leu Ala Arg Glu Tyr He Tyr Cys Leu Tyr Trp 

Ser Thr Leu Thr Leu Thr Thr He Gly Glu Thr Pro Pro Pro Val 
335 340 345 

Lys Asp Glu Glu Tyr Leu Phe Val He Phe Asp Phe Leu He Gly 

350 355 3o0 

Val Leu He Phe Ala Thr He Val Gly Asn Val Gly Ser Met He 

365 370 3"5 

Ser Asn Met Asn Ala Thr Arg Ala Glu Phe Gin Ala Lys He Asp 

380 385 390 

Ala val Lys His Tyr Met Gin Phe Arg Lys Val Ser Lys Gly Met 

395 400 405 

Glu Ala Lys Val He Arg Trp Phe Asp Tyr Leu Trp Thr Asn Lys 

410 415 
Lys Thr Val Asp Glu Arg Glu He Leu Lys Asn Leu Pro Ala Lys 

425 430 435 

Leu Arg Ala Glu He Ala He Asn Val His Leu Ser Thr Leu Lys 

440 445 450 

Lys Val Arg He Phe His Asp Cys Glu Ala Gly Leu Leu Val Glu 

455 460 465 

Leu Val Leu Lys Leu Arg Pro Gin Val Phe Ser Pro Gly Asp Tyr 

470 475 
He cys Arg Lys Gly Asp He Gly Lys Glu Met Tyr He He Lys 

485 490 
Glu Gly Lys Leu Ala Val Val Ala Asp Asp Gly Val Thr Gin Tyr 

500 505 a-LU 

Ala Leu Leu Ser Ala Gly Ser Cys Phe Gly Glu He Ser He Leu 

515 520 525 

Asn He Lys Gly Ser Lys Met Gly Asn Arg Arg Thr Ala Asn He 

530 535 
Arg Ser Leu Gly Tyr ser Asp Leu Phe cys Leu Ser Lys Asp Asp 

Leu Met Glu Ala Val Thr Glu Tyr Pro Asp Ala Lys Lys Val Leu 

560 565 
Glu Glu Arg Gly Arg Glu He Leu Met Lys Glu Gly Leu Leu Asp 

575 580 585 

Glu Asn Glu Val Ala Thr Ser Met Glu Val Asp Val Qln Glu Lys 

590 595 oiJ" 

Leu Gly Gin Leu Glu Thr Asn Met Glu Thr Leu Tyr Ttor Arg Phe 

605 610 615 

Gly Arg Leu Leu Ala Glu Tyr Thr Gly Ala Gin Gin Lys Leu Lys 

620 625 630 

Gin Arg He Thr Val Leu Glu Thr Lys Met Lys Gin Asn Asn Glu 

635 640 645 

Asp Asp Tyr Leu Ser Asp Gly Met Asn Ser Pro Glu Leu Ala Ala 

650 655 660 

Ala Asp Glu Pro 



<210> 8 
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<211> 242 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc^feature 

<223> Incyte ID No: 1812692CD1 

<400> 8 ' ^ 

Met Ser Phe Arg Ala Ala Arg Leu Ser Met Arg Asn Arg Arg Asn 
15 10 15 

Asp Thr Leu Asp Ser Thr Arg Thr Leu Tyr Ser Ser Ala Ser Arg 

20 25 30 

Ser Thr Asp Leu Ser Tyr Ser Glu Ser Asp Leu Val Asn Phe lie 
35 40 45 

Gin Ala Asn Phe Lys Lys Arg Glu Cys Val Phe' Phe Thr Lys Asp 
SO 55 60 

Ser Lys Ala Thr Glu Asn Val Cys Lys Cys Gly Tyr Ala Gin Ser 
65 70 75 

Gin His Met Glu Gly Thr Gin lie Asn Gin Ser Glu Lys Trp Asn 
80 85 90 

Tyr Lys Lys His Thr Lys Glu Phe Pro Thr Asp Ala Phe Gly Asp 
95 100 105 

He Gin Phe Glu Thr Leu Gly Lys Lys Gly Lys Tyr He Arg Leu 

110 115 120 

Ser Cys Asp Thr Asp Ala Glu He Leu Tyr Glu Leu Leu Thr Gin 

125 130 135 

His Trp His Leu Lys Thr Pro Asn Leu Val He Ser Val Thr Gly 

140 145 150 

Gly Ala Lys Asn Phe Ala Leu Lys Pro Arg Met Arg Lys He Phe 

155 160 165 

Ser Arg Leu He Tyr He Ala Gin Ser Lys Gly Ala Trp He Leu 

170 175 180 

Thr Gly Gly Thr His Tyr Gly Leu Met Lys Tyr He Gly Glu Val 

185 190 195 

Val Arg Asp Asn Thr He Ser Arg Ser Ser Glu Glu Asn He Val 

200 205 210 

Ala He Gly He Ala Ala Trp Gly Met Val Ser Asn Arg Asp Thr 

215 220 225 

Leu He Arg Asn Cys Asp Ala Glu Val Pro Val Gly Gin Glu Glu 

230 235 240 

Val Cys 



<210> 9 

<211> 398 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 3232992CD1 



<400> 9 












Gly Tyr Leu Gly Asp Arg Phe 


Asn 


Met 


Val 


Ala 


Ala 


Pro 


He 


Phe 


1 








5 






10 


15 


Arg 


Lys 


Val 


He 


Leu 


Ser 


cys 


Gly He Phe Phe Trp Ser Ala 


Val 






20 






25 


30 


Thr 


Phe 


Ser 


Ser 


Ser 


Phe 


He 


Pro Gin Gin Tyr Phe Trp Leu 


Leu 






35 






40 


45 


Val 


Leu 


Ser 


Arg 


Gly Leu 


Val 


Gly He Gly Glu Ala Ser Tyr 


Ser 








50 






55 


60 


Thr 


He 


Ala 


Pro 


Thr 


He 


He 


Gly Asp Leu Phe Thr Lys Asn 


Thr 






65 






70 


75 


Arg 


Thr 


Leu 


Met 


Leu 


Ser 


Val 


Phe Tyr Phe Ala He Pro Leu Gly 








80 






85 


yu 


Ser 


Gly 


Leu 


Gly 


Tyr 


He 


Thr 


Gly Ser Ser Val Lys Gin Ala 


Ala 
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95 100 105 

Gly Asp Trp His Trp Ala Leu Arg Val Ser Pro Val Leu Gly Met 
110 115 120 

He Thr Gly Thr Leu He Leu He Leu Val Pro Ala Ttor Lys Arg 

125 130 ^ 

Gly His Ala Asp Gin Leu Gly Asp Gin Leu Lys Ala Arg Thr Ser 

140 145 
Trp Leu Arg Asp Met Lys Ala Leu He Arg Asn Arg Ser Tyr .Val 
155 160 165 

Phe ser Ser Leu Ala Thr Ser Ala Val Ser Phe Ala Thr Gly Ala • 

170 175 180 

Leu Gly Met Trp He Pro Leu Tyr Leu His Arg Ala Gin Val Val 
185 190 195 

Gin Lys Thr Ala Glu Thr cys Asn Ser Pro Pro Cys Gly Ala Lys 
200 205 210 

Asp Ser Leu lie Phe Gly Ala He Thr Cys Phe Thr Gly Phe Leu 
*^ 215 220 22b 

Gly Val Val Thr Gly Ala Gly Ala Thr Arg Trp Cys Arg Leu Lys 

230 235 ^ , in 

Thr Gin Arg Ala Asp Pro Leu Val Cys Ala Val Gly Met Leu Gly 
245 250 255 

ser Ala He Phe He Cys Leu He Phe Val Ala Ala Lys Ser Ser 
260 265 270 

He Val Gly Ala Tyr He Cys He Phe Val Gly Glu Thr Leu Leu 
275 280 28b 

Phe ser Asn Trp Ala He Thr Ala Asp He Leu Met Tyr Val Val 

290 295 
He Pro Thr Arg Arg Ala Thr Ala Val Ala Leu Gin Ser Phe Thr 
305 310 315 

Ser His Leu Leu Gly Asp Ala Gly Ser Pro Tyr Leu He Gly Phe 
320 325 330 

He Ser Asp Leu He Arg Gin Ser Thr Lys Asp Ser Pro Leu Trp 
335 340 345 

Glu Phe Leu Ser Leu Gly Tyr Ala Leu Met Leu Cys Pro Phe Val 
350 355 360 

Val Val Leu Gly Gly Met Phe PKe Leu Ala Thr Ala Leu Phe Phe 
365 370 375 

Val ser Asp Arg Ala Arg Ala Glu Gin Gin Val Asn Gin Leu Ala 

jyu 



Met Pro Pro Ala Ser Val Lys Val 
395 

<210> 10 

<211> 553 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 33583 83CD1 

<400> 10 



Met^Ala^Phe Gin Asp Leu Leu Gly His Ala Gly Asp Leu Trp Arg 

1 5 10 

Phe Gin He Leu Gin Thr Val Phe Leu Ser He Phe Ala Val Ala 
20 25 30 

Thr Tyr Leu His Phe Met Leu Glu Asn Phe Thr Ala Phe He Pro 
35 40 45 

Gly His Arg Cys Trp Val His He Leu Asp Asn Asp Thr Val Ser 
50 55 °^ 

Asp Asn Asp Thr Gly Ala Leu Ser Gin Asp Ala Leu Leu Arg lie 
65 - 70 75 

Ser He Pro Leu Asp Ser Asn Met Arg Pro Glu Lys cys Arg Arg 
80 85 90 

Phe Val His Pro Gin Trp Gin Leu Leu His Leu Asn Gly Thr Phe 

95 100 
Pro Asn Thr Ser Asp Ala Asp Met Glu Pro Cys Val Asp Gly Trp 
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110 115 
Val Tvr Asp Arg He Ser Phe Ser Ser Thr He Val Thr Glu Trp 

125 130 ■'■^ I 

Asp Leu Val Cys Asp Ser Gin Ser Leu Thr Ser Val Ala Lys Phe 

140 145 
val Phe Met Ala Gly Met Met Val Gly Gly He Leu Gly Gly His 
155 160 lo5 

Leu ser Asp Arg Phe Gly Arg Arg Phe Val Leu Arg Trp Cys Tyr 
170 175 IfO 

Leu Gin Val Ala He Val Gly Thr Cys Ala Ala Leu Ala Pro Thr 
185 190 19 = 

Phe Leu He Tyr Cys Ser Leu Arg Phe Leu Ser Gly He Ala Ala 

200 205 210 j 

Met Ser Leu He Thr Asn Thr He Met Leu He Ala Glu Trp Ala 
215 220 225 

Thr His Airg Phe Gin Ala Met Gly He Thr Leu Gly Met Cys Pro 
230 235 ■^^^ 

Ser Gly He Ala Phe Met Thr Leu Ala Gly Leu Ala Phe Ala He 
245 250 255 

Arg Asp Trp His He Leu Gin Leu Val Val Ser Val Pro Tyr Phe 
260 265 270 

Val He Phe Leu Thr Ser Ser Trp Leu Leu Glu Ser Ala Arg Trp 
275 280 285 

Leu He He Asn Asn Lys Pro Glu Glu Gly Leu Lys Glu Leu Arg 
290 295 300 

Lys Ala Ala His Arg Ser Gly Met Lys Asn Ala Arg Asp Thr Leu 
305 310 315 

Thr Leu Glu He Leu Lys Ser Thr Met Lys Lys Glu Leu Glu Ala 
320 325 ^ 330 

Ala Gin Lys Lys Lys Pro Ser Leu Cys Glu Met Leu His Met Pro 
335 340 Jft^ 

Asn He Cys Lys Arg He Ser Leu Leu Ser Phe Thr Arg Phe Ala 
350 355 360 

Asn Phe Met Ala Tyr Phe Gly Leu Asn Leu His Val Gin His Leu 
365 370 = 

Gly Asn Asn Val Phe Leu Leu Gin Thr Leu Phe Gly Ala Val He 
380 385 390 

■ Leu Leu Ala Asn Cys Val Ala Pro Trp Ala Leu Lys Tyr Met Thr 
3 95 400 405 

Arg Arg Ala Ser Gin Met Arg Leu Met Tyr Leu Leu Ala He Cys 

410 415 
Phe Met Ala He He Phe Val Pro Gin Glu Met Gin Thr Leu Arg 
425 430 435 

Glu Val Leu Ala Thr Leu Gly Leu Gly Ala Ser Ala Leu Thr Asn 
440 445 ^->^ 

Thr LGU Ala Phe Ala His Gly Asn Glu Val He Pro Thr He lie 

455 460 
Arg Ala Arg Ala Met Gly He Asn Ala Thr Phe Ala Asn He Ala 

470 475 
Gly Ala Leu Ala Pro Leu Met Met He Leu Ser Val Tyr Ser Pro 

485 490 
Pro Leu Pro Trp He He Tyr Gly Val Phe Pro Phe He Ser Gly 
500 505 ^J-^ 

Phe Ala Phe Leu Leu Leu Pro Glu Thr Arg Asn Lys Pro Leu Phe 
515 520 525 

Asp Thr He Gin Asp Glu Lys Asn Glu Arg Lys Asp Pro Arg Glu 

Pro Lys Gin Glu Asp Pro Arg Val Glu Val Thr Gin Phe 
545 550 



<210> 11 

<211> 213 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 
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<223> Incyte ID No: 4250091CD1 

Met°ser'''ser Gin Glu Leu Val Thr Leu Asn Vai Gly Gly Lys lie 

15 10 
Phe Thr Thr Arg Phe Ser Thr He Lys Gin Phe Pro Ala Ser Arg 

20 25 30 

Leu Ala Arg Met Leu Asp Gly Arg Asp Gin Glu Phe Lys Met Val 

35 40 45 

Glv Gly Gin lie Phe Val Asp Arg Asp Gly Asp Leu Phe Ser Phe 

50 55 60 

He Leu Asp Phe Leu Arg Thr His Gin Leu Leu Leu Pro Thr Glu 
65 70 75 

Phe Ser Asp Tyr Leu Arg Leu Gin Arg Glu Ala Leu Phe Tyr Glu 
80 85 90 

Leu Arg Ser Leu Val Asp Leu Leu Asn Pro Tyr Leu Leu Gin Pro 
95 100 105 

Arg Pro Ala Leu Val Glu Val His Phe Leu ser Arg Asn Thr Gin 

110 115 120 

Ala Phe Phe Arg Val Phe Gly Ser Cys Ser Lys Thr He Glu Met 

125 130 135 

Leu Thr Gly Arg He Thr Val Phe Thr Glu Gin Pro Ser Ala Pro 

140 145 1^0 

Thr Trp Asn Gly Asn Phe Phe Pro Pro Gin Met Thr Leu Leu Pro 

155 160 165 

Leu Pro Pro Gin Arg Pro Ser Tyr His Asp Leu Val Phe Gin Cys 

170 175 ^ 180 

Gly Ser Asp Ser Thr Thr Asp Asn Gin Thr Gly Val Arg Tyr Phe 

185 190 195 

Val Leu cys Ser He Ser Leu Val Tyr Gin Phe Val Met Phe Ser 

200 205 210 

Leu Lys Thr 



<210> 12 

<211> 476 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> niisc_feat\ire 

<223> Incyte ID No; 70064803CD1 

Met°Ali^Gly Ser Asp Thr Ala Pro Phe Leu Ser Gin Ala Asp Asp 
15 10 15 

Pro Asp Asp Gly Pro Val Pro Gly Thr Pro Gly Leu Pro Gly Ser 

20 25 30 

Thr Gly Asn Pro Lys Ser Glu Glu Pro Glu Val Pro Asp Gin Glu 

35 40 45 

Gly Leu Gin Arg He Thr Gly Leu Ser Pro Gly Arg Ser Ala Leu 

50 55 60 

He Val Ala Val Leu Cys Tyr He Asn Leu Leu Asn Tyr Met Asp 

65 70 75 

Arg Phe Thr Val Ala Gly Val Leu Pro Asp He Glu Gin Phe Phe 

80 85 90 

Asn He Gly Asp Ser Ser Ser Gly Leu He Gin Thr Val Phe He 

95 100 105 

Ser Ser Tyr Met Val Leu Ala Pro Val Phe Gly Tyr Leu Gly Asp 

110 , „^ tr 

Arg Tyr Asn Arg Lys Tyr Leu Met Cys Gly Gly He Ala Phe Trp 

125 130 135 

Ser Leu Val Thr Leu Gly Ser Ser Phe He Pro Gly Glu His Phe 

140 145 150 

Trp Leu Leu Leu Leu Thr Arg Gly Leu Val Gly Val Gly Glu Ala 

155 160 165 

Ser Tyr Ser Thr He Ala Pro Thr Leu He Ala Asp Leu Phe Val 
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170 175 180 

Ala Asp Gin Arg Ser Arg Met Leu Ser He Phe Tyr Phe Ala He 

185 190 195 

Pro Val Gly Ser Gly Leu Gly Tyr He Ala Gly Ser Lys Val Lys 

200 205 210 

Asp Met Ala Gly Asp Trp His Trp Ala Leu Arg Val Thr Pro Gly 

215 220 225 

Leu Gly Val Val Ala Val Leu Leu Leu Phe Leu Val Val Arg Glu 

230 235 240 

Pro Pro Arg Gly Ala Val Glu Arg His Ser Asp Leu Pro Pro Leu 

245 250 255 

Asn Pro Thr Ser Trp Trp Ala Asp Leu Arg Ala Leu Ala Arg Asn 

260 265 270 

Leu He Phe Gly Leu He Thr Cys Leu Thr Gly Val Leu Gly Val 

275 280 285 

Gly Leu Gly Val Glu He Ser Arg Arg Leu Arg His Ser Asn Pro 

290 295 300 

Arg Ala Asp Pro Leu Val Cys Ala Thr Gly Leu Leu Gly Ser Ala 

305 310 315 

Pro Phe Leu Phe Leu Ser Leu Ala Cys Ala Arg Gly Ser He Val 

320 325 330 

Ala Thr Tyr He Phe He Phe He Gly Glu Thr Leu Leu Ser Met 

335 340 345 

Asn Trp Ala He Val Ala Asp He Leu Leu Tyr Val Val He Pro 

350 355 360 

Thr Arg Arg Ser Thr Ala Glu Ala Phe Gin He Val Leu Ser His 

365 370 375 

Leu Leu Gly Asp Ala Gly Ser Pro Tyr Leu He Gly Leu He Ser 

380 385 390 

Asp Arg Leu Arg Arg Asn Trp Pro Pro Ser Phe Leu Ser Glu Phe 

395 400 405 

Arg Ala Leu Gin Phe Ser Leu Met Leu Cys Ala Phe Val Gly Ala 

410 415 420 

Leu Gly Gly Ala Ala Phe Leu Gly Thr Ala He Phe He Glu Ala 

425 430 435 

Asp Arg Arg Arg Ala Gin Leu His Val Gin Gly Leu Leu His Glu 
^ 440 445 450 

Ala Gly Ser Thr Asp Asp Arg He Val Val Pro Gin Arg Gly Arg 

455 460 465 

Ser Thr Arg Val Pro Val Ala Ser Val Leu He 

470 475 

<210> 13 

<211> 246 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 70356768CD1 

<400> 13 . n . 

Met Leu His Ala Leu Leu Arg Ser Arg Met He Gin Gly Arg He 
15 10 15 

Leu Leu Leu Thr He Cys Ala Ala Gly He Gly Gly Thr Phe Gin 

20 25 30 

Phe Gly Tyr Asn Leu Ser He lie Asn Ala Pro Thr Leu His He 

35 40 45 

Gin Glu Phe Thr Asn Glu Thr Trp Gin Ala Arg Thr Gly Glu Pro 

50 55 60 

Leu Pro Asp His Leu Val Leu Leu Met Trp Ser Leu He Val Ser 

65 70 75 

Leu Tyr Pro Leu Gly Gly Leu Phe Gly Ala Leu Leu Ala Gly Pro 

80 85 90 

Leu Ala He Thr Leu Gly Arg Lys Lys Ser Leu Leu Val Asn Asn 

95 100 105 

He Phe Val Val Ser Ala Ala He Leu Phe Gly Phe Ser Arg Lys 
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110 115 120 

Ala Gly Ser Phe GIu Met He Met Leu Gly Arg Leu Leu Val Gly 

125 130 135 , 

Val Asn Ala Gly Val Ser Met Asn He Gin Pro Met Tyr Leu Gly 

140 145 150 

Glu Ser Ala Pro Lys Glu Leu Arg Gly Ala Val Ala Met Ser Ser 

155 160 165 

Ala He Phe Thr Ala Leu Gly He Val Met Gly Gin Val Val Gly 

170 175 180 . 

Leu Arg Glu Leu Leu Gly Gly Pro Gin Ala Trp Pro Leu Leu Leu 

185 190 195 

Ala Ser Cys Leu Val Pro Gly Ala Leu Gin Leu Ala Ser Leu Pro 

200 205 210 j 

Leu Leu Pro Glu Ser Pro Arg Tyr Leu Leu He Asp Cys Gly Asp 

215 220 225 , 

a?hr Glu Ala Cys Leu Ala Glu Thr Gly Ser Arg Leu Ser Arg Leu 

230 235 240 

Glu Cys cys Gly Cys Ser 

245 

<210> 14 
<211> 436 
<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 5674114CD1 

<400> 14 .... 
Met Gly Leu Ala Arg Ala Leu Arg Arg Leu Ser Gly Ala Leu Asp 
1.5 10 15 

Ser Gly Asp Ser Arg Ala Gly Asp Glu Glu Glu Ala Gly Pro Gly 

20 25 30 

Leu Cys Arg Asn Gly Trp Ala Pro Ala Pro Val Gin Ser Pro Val 
35 40 • 45 

Gly Arg Arg Arg Gly Arg Phe Val Lys Lys Asp Gly His Cys Asn 
50 55 60 

Val Arg Phe Val Asn Leu Gly Gly Gin Gly Ala Arg Tyr Leu Ser 
65 70 75 

Asp Leu Phe Thr Thr Cys Val Asp Val Arg Trp Arg Trp Met Cys 
80 85 90 

Leu Leu Phe Ser Cys Ser Phe Leu Ala Ser Trp Leu Leu Phe Gly 
95 100 105 

Leu Ala Phe Trp Leu He Ala Ser Leu His Gly Asp Leu Ala Ala 

110 115 120 

Pro Pro Pro Pro Ala Pro Cys Phe Ser His Val Ala Ser Phe Leu 

125 130 135 

Ala Ala Phe Leu Phe Ala Leu Glu Thr Gin Thr Ser He Gly Tyr 

140 145 150 

Glv Val Arg Ser Val Thr Glu Glu Cys Pro Ala Ala Val Ala Ala 

155 160 165 

Val Val Leu Gin Cys He Ala Gly Cys Val Leu Asp Ala Phe Val 

170 175 180 

Val Gly Ala Val Met Ala Lys Met Ala Lys Pro Lys Lys Arg Asn 

185 190 195 

Glu Thr Leu Val Phe Ser Glu Asn Ala Val Val Ala Leu Arg Asp 

200 205 210 

His Arg Leu Cys Leu Met Trp Arg Val Gly Asn Leu Arg Arg Ser 

215 220 225 

His Leu Val Glu Ala His Val Arg Ala Gin Leu Leu Gin Pro Arg 

230 235 240 

Val Thr Pro Glu Gly Glu Tyr He Pro Leu Asp His Gin Asp Val 

245 250 255 

Asp Val Gly Phe Asp Gly Gly Thr Asp Arg He Phe Leu Val Ser 
^ 260 265 270 

Pro He Thr He Val His Glu He Asp Ser Ala Ser Pro Leu Tyr 
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275 280 285 

Glu Leu Gly Arg Ala Glu Leu Ala Arg Ala Asp Phe Glu Leu Val 

290 295 -^^^ 

val lie Leu Glu Gly Met Val Glu Ala Thr Ala Met -Ehr Thr Gin 

305 

CVS Arg Ser Ser Tyr Leu Pro Gly Glu Leu Leu Trp Gly His Arg 

320 ^25 
Phe Glu Pro Val Leu Phe Gin Arg Gly Ser Gin Tyr Glu Val Asp 

335 340 -^^^ 

Tyr Arg His Phe His Arg Thr Tyr Glu Val Pro Gly Thr Pro Val 

350 355 -^Y^ 

cys Ser Ala Lys Glu Leu Asp Glu. Arg Ala Glu Gin Ala Ser His 

ser Leu Lys Ser Ser Phe Pro Gly Ser Leu Thr Ala Phe Cys Tyr 

380 385 
Glu Asn Glu Leu Ala Leu Ser Cys Cys Gin Glu Glu Asp Glu Asp 

Asp Glu Thr Glu Glu Gly Asn Gly Val Glu Thr Glu Asp Gly Ala 

Ala Ser Pro Arg ill Leu Thr Pro Thr Leu Ala Leu Thr Leu Pro 
425 430 '^^^ 

Pro 



<210> 15 
<211> 453 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: 1254635CD1 

Mt2°Lei\ys Met Val Leu Thr Glu Asn Pro Asn Gin Glu He Ala 

Thr ser Leu Glu Phi Leu Leu Leu Gin Asn Ser Pro Gly Ser Leu 

20 * 25 

Arg Ala Gin Gin Arg Met Ser Tyr Tyr Gly Ser Ser Tyr His lie 

3 35 40 

He Asn Ala Asp Ala Lys Tyr Pro Gly Tyr Pro Pro Glu His lie 

He Ala Glu Lys A^g Arg Ala Arg Arg Arg Leu Leu His Lys ;^p 

65 '0 
Gly ser Cys Asn Val Tyr Phe Lys His lie Phe Gly Glu Trp Gly 

80 ^5 
ser Tyr Val Val Asp He Phe Thr -Thr Leu Val Asp Thr Lys Trp 
95 100 -^^^ 

Arg His Met Phe Val He Phe Ser Leu Ser Tsn: He Leu Ser Trp 
110 -LIS ^ "^^^ 

Leu He Phe Gly Ser Val Phe Trp Leu He Ala Phe His Has Gly 

125 130 
Asp Leu Leu Asn Asp Pro Asp He Thr Pro Cys Val Asp Asn Val 
140 

His ser Phe Thr Gly Ala Phe Leu Phe Ser Leu Glu Thr Gin Thr 

155 160 
Thr He Gly Tyr Gly Tyr Arg Cys Val Thr Glu Glu Cys Ser Val 

170 

Ala Val Leu Met Val He Leu Gin Ser He Leu Ser Cys He He 

185 190 J-^^ 

Asn Thr Phe He He Gly Ala Ala Leu Ala Lys Met Ala Thr Ala 

200 205 
Arg Lys Arg Ala Gin Thr He Arg Phe Ser Tyr Phe Ala Leu He 

215 220 ■^^^ 

Gly Met Arg Asp Gly Lys Leu Cys Leu Met Trp Arg He Gly Asp 

230 235 
Phe Arg Pro Asn His Val Val Glu Gly Thr Val Arg Ala Gin Leu 
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245 250 255 

Leu Arg Tyr Thr Glu Asp Ser Glu Gly Arg Met Thr Met Ala Phe 

260 265 270 ^ 

Lys Asp Leu Lys Leu Val Asn Asp Gin He He Leu Val Thr Pro 

275 280 285 

val Thr He Val His Glu He Asp His Glu Ser Pro Leu Tyr Ala 

290 295 300 

Leu Asp Arg Lys Ala Val Ala Lys Asp Asn Phe Glu He Leu Val, 

305 310 315 

Thr Phe He Tyr Thr Gly Asp Ser Thr Gly Thr Ser His Gin Ser 

320 325 330 

Arg Ser Ser Tyr Val Pro Arg Glu He lieu Trp Gly His Arg Phe 

335 340 345 | 

Asn Asp Val Leu Glu Val Lys Arg Lys Tyr Tyr Lys Val Asn Cys 

350 355 360 , 

Leu Gin Phe Glu Gly Ser Val Glu Val Tyr Ala Pro Phe Cys Ser 

365 370 375 

Ala Lys Gin Leu Asp Trp Lys Asp Gin Gin Leu His He Glu Lys 

380 385 390 

Ala Pro Pro Val Arg Glu Ser Cys Thr Ser Asp Thr Lys Ala Arg 

395 400 405 

Arg Arg Ser Phe Ser Ala Val Ala He Val Ser Ser Cys Glu Asn 
^ ^ 410 415 420 

Pro Glu Glu Thr Thr Thr Ser Ala Thr His Glu Tyr Arg Glu Thr 

425 430 435 

Pro Tvr Gin Lys Ala Leu Leu Thr Leu Asn Arg He Ser Val Glu 

440 445 450 

Ser Gin Met 



<210> 16 

<211> 299 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 1670595CD1 

Met^Ala^Ser Glu Ser Ser Pro Leu Leu Ala Tyr Arg Leu Leu Gly 
15 10 15 

Glu Glu Gly Val Ala Leu Pro Ala Asn Gly Ala Gly Gly Pro Gly 
20 25 30 

Gly Ala Ser Ala Arg Lys Leu Ser Thr Phe Leu Gly Val Val Val 
35 40 45 

Pro Thr Val Leu Ser Met Phe Ser He Val Val Phe Leu Arg He 
50 55 60 

Gly Phe val Val Gly His Ala Gly Leu Leu Gin Ala Leu Ala Met 
65 70 .75 

Leu Leu Val Ala Tyr Phe He Leu Ala Leu Thr Val Leu Ser Val 
80 85 90 

cys Ala He Ala Thr Asn Gly Ala Val Gin Gly Gly Gly Ala Tyr 
95 100 105 

cys He Leu Gin His Arg Trp Thr Gly Met Pro Gin Gly Pro Val 
110 115 120 

Gly Ser Gly Ser Cys Pro Arg Ala Thr Ala Trp Asn Leu Leu Tyr 
125 130 135 

Gly Ser Leu Leu Leu Gly Leu Val Gly Gly Val Cys Thr Leu Gly 
140 145 150 

Ala Gly Leu Tyr Ala Arg Ala Ser Phe Leu Thr Phe Leu Leu Val 
155 160 165 

Ser Gly Ser Leu Ala Ser Val Leu He Ser Phe Val Ala Val Gly 
170 175 ' 180 

Pro Arg Asp He Arg Leu Thr Pro Arg Pro Gly Pro Asn Gly Ser 
185 190 195 

Ser Leu Pro Pro Arg Phe Gly His Phe Thr Gly Phe Asn Ser Ser 
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200 205 210 

Thr Leu Lys Asp Asn Leu Gly Ala Gly Tyr Ala Glu Asp Tyr Thr 

215 220 225 

Thr Gly Ala Val Met Asn Phe Ala Ser Val Phe Ala Val Leu Phe 

230 235 240 

Asn Gly Arg His His Gly Trp Gly Gin His Val Arg Gly Ala Glu 

245 250 255 

Gly Pro Gin Pro Gly Asp Pro Ser Gly His Asp Arg Arg Arg Arg 

260 265 270 

Leu His Leu Leu Arg Leu Cys Pro Ala Phe Leu Ser Leu Gin Pro 

275 280 285 

Pro Phe Thr Gly Ala Leu Met Leu Gly Ala Arg Pro Pro Leu 

290 295 

<210> 17 

<211> 606 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 1859560CD1 

<400> 17 

Met Pro ser Ser Val Thr Ala Leu Gly Gin Ala Arg Ser Ser Gly 
15 10 15 

Pro Gly Met Ala Pro Ser Ala Cys Cys Cys Ser Pro Ala Ala Leu 
20 25 30 

Gin Arg Arg Leu Pro lie Leu Ala Trp Leu Pro Ser Tyr Ser Leu 
35 40 45 

Gin Trp Leu Lys Met Asp Phe Val Ala Gly Leu Ser Val Gly Leu 
50 55 60 

Thr Ala He Pro Gin Ala Leu Ala Tyr Ala Glu Val Ala Gly Leu 
65 70 75 

Pro Pro Gin Tyr Gly Leu Tyr Ser Ala Phe Met Gly Cys Phe Val 
80 85 90 

Tyr Phe Phe Leu Gly Thr Ser Arg Asp Val Thr Leu Gly Pro Thr 
95 100 105 

Ala He Met Ser Leu Leu Val Ser Phe Tyr Thr Phe His Glu Pro 

110 115 120 

Ala Tyr Ala Val Leu Leu Ala Phe Leu Ser Gly Cys He Gin Leu 

125 130 135 

Ala Met Gly Val Leu Arg Leu Gly Phe Leu Leu Asp Phe He Ser 

140 145 150 

Tyr Pro Val He Lys Gly Phe Thr Ser Ala Ala Ala Val Thr He 

155 160 165 

Gly Phe Gly Gin He Lys Asn Leu Leu Gly Leu Gin Asn He Pro 

170 175 180 

Arg Pro Phe Phe Leu Gin Val Tyr His Thr Phe Leu Arg He Ala 

185 190 195 

Glu Thr Arg Val Gly Asp Ala Val Leu Gly Leu Val Cys Met Leu 

200 205 210 

Leu Leu Leu Val Leu Lys Leu Met Arg Asp His Val Pro Pro Val 

215 220 225 

His Pro Glu Met Pro Pro Gly Val Arg Leu Ser Arg Gly Leu Val 

230 235 240 

Trp Ala Ala Thr Thr Ala Arg Asn Ala Leu Val Val Ser Phe Ala 

245 250 255 

Ala Leu Val Ala Tyr Ser Phe Glu Val Thr Gly Tyr Gin Pro Phe 

260 265 270 

He Leu Thr Gly Glu Thr Ala Glu Gly Leu Pro Pro Val Arg He 

275 280 285 

Pro Pro Phe Ser Val Thr Thr Ala Asn Gly Thr He Ser Phe Thr 

290 295 300 

Glu Met Val Gin Asp Met Gly Ala Gly Leu Ala Val Val Pro Leu 

305 310 315 

Met Gly Leu Leu Glu Ser He Ala Val Ala Lys Ala Phe Ala Ser 
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320 325 330 

Gin Asn Asn Tyx Arg lie Asp Ala Asn Gin Glu Leu Leu Ala lie 

335 340 , 345 

Gly Leu Thr Asn Met Leu Gly Ser Leu Val ser Ser Tyr Pro Val 

350 355 360 

■Thr Gly Ser Phe Gly Arg Thr Ala Val Asn Ala Gin Ser Gly Vkl 

365 370 375 

Cys Thr Pro Ala Gly Gly Leu Val Thr Gly Val Leu Val Leu Leu 

380 385 390 

Ser Leu Asp Tyr Leu Thr Ser Leu Phe Tyr Tyr lie Pro Lys Ser 

395 400 405 

Ala Leu Ala Ala Val He He Met Ala Val Ala Pro Leu Phe Asp 

410 415 420 

Thr Lys He Phe Arg Thr Leu Trp Arg Val Lys Arg Leu Asp Leu 

425 430 435 

Leu Pro Leu Cys Val Thr Phe Leu Leu Cys Phe' Trp Glu Val Gin 

440 445 450 

Tyr Gly He Leu Ala Gly Ala Leu Val Ser Leu Leu Met Leu Leu 

455 460 465 

His Ser Ala Ala Arg Pro Glu Thr Lys Val Ser Glu Gly Pro Val 

470 475 480 

Leu Val Leu Gin Pro Ala Ser Gly Leu Ser Phe Pro Ala Met Glu 

485 490 495 

Ala Leu Arg Glu Glu He Leu Ser Arg Ala Leu Glu Val Ser Pro 

500 505 510 

Pro T^g Cys Leu Val Leu Glu Cys Thr His Val Cys Ser He Asp 

515 520 525 

Tyr Thr Val Val Leu Gly Leu Gly Glu Leu Leu Gin Asp Phe Gin 

530 535 540 

Lys Gin Gly Val Ala Leu Ala Phe Val Gly Leu Gin Val Pro Val 

545 550 555 

Leu Arg Val Leu Leu Ser Ala Asp Leu Lys Gly Phe Gin Tyr Phe 

560 565 570 

Ser Thr Leu Glu Glu Ala Glu Lys His Leu Arg Gin Glu Pro Gly 

575 580 585 

Thr Gin Pro Tyr Asn He Arg Glu Asp Ser He Leu Asp Gin Lys 

590 595 600 

Val Ala Leu Leu Lys Ala 

605 



<210> 18 

<211> 324 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 5530164CD1 

<400> 18 



Met 


Ser 


Val 


Glu 


Asp Gly 


Gly 


Met 


Pro 


Gly Leu Gly Arg 


Pro 


Arg 


1 








5 










10 








15 


Gin 


Ala 


Arg 


Trp 


Thr 


Leu 


Met 


Leu 


Leu 


Leu 


Ser 


Thr Ala 


Met 


Tyr 






20 










25 








30 


Gly 


Ala 


His 


Ala 


Pro 


Leu 


Leu 


Ala 


Leu 


Cys 


His 


Val Asp 


Gly Arg 








35 










40 








45 


Val 


Pro 


Phe 


Arg 


Pro 


Ser 


Ser 


Ala 


Val 


Leu 


Leu 


Thr Glu 


Leu 


Thr 








50 










55 








60 


Lys 


Leu 


Leu 


Leu 


Cys 


Ala 


Phe 


Ser 


Leu 


Leu 


Val 


Gly Trp 


Gin 


Ala 








65 










70 








75 


Trp 


Pro 


Gin 


Gly 


Pro 


Pro 


Pro 


Trp Arg 


Gin 


Ala 


Ala Pro 


Phe 


Ala 






80 










85 








90 


Leu 


Ser 


Ala 


Leu 


Leu 


Tyr 


Gly 


Ala 


Asn 


Asn 


Asn 


Leu Val 


He 


Tyr 










95 






100 








105 


Leu 


Gin 


Arg 


Tyr 


Met 


Asp 


Pro 


Ser 


Thr 


Tyr 


Gin 


Val Leu 


Ser 


Asn 








110 








115 








120 


Leu 


Lys 


He 


Gly 


Ser 


Thr 


Ala 


Val 


Leu 


Tyr 


Cys 


Leu cys 


Leu Arg 
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125 130 135 

His Arg Leu Ser Val Arg Gin Gly Leu Ala Leu Leu Leu Leu Met 
140 145 150 

Ala Ala Gly Ala Cys Tyr Ala Ala Gly Gly Leu Gin Val Pro Gly 

155 160 165 

Asn Thr Leu Pro Ser Pro Pro Pro Ala Ala Ala Ala Ser Pro Met 

170 175 180 

Pro Leu His He Thr Pro Leu Gly Leu Leu Leu Leu He Leu Tyr 

185 190 195 

CVS Leu He Ser Gly Leu Ser Ser Val Tyr Thr Glu Leu Leu Met 
^ 200 205 210 

Lys Arg Gin Arg Leu Pro Leu Ala Leu Gin Asn Leu Phe Leu Tyr 

215 220 225 

Thr Phe Gly Val Leu Leu Asn Leu Gly Leu His Ala Gly Gly Gly 

230 235 ' 240 

Ser Gly Pro Gly Leu Leu Glu Gly Phe Ser Gly Trp Ala Ala Leu 

245 250 255 

Val Val Leu Ser Gin Ala Leu Asn Gly Leu Leu Met Ser Ala Val 

260 265 270 

Met Lvs His Gly Ser Ser He Thr Arg Leu Phe Val Val Ser Cys 

275 280 285 

Ser Leu Val Val Asn Ala Val Leu Ser Ala Val Leu Leu Arg Leu 

290 295 300 

Gin Leu Thr Ala Ala Phe Phe Leu Ala Thr Leu Leu He. Gly Leu 

305 310 315 

Ala Met Arg Leu Tyr Tyr Gly Ser Arg 

320 

<210> 19 
<211> 445 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc^feature 

<223> Incyte ID No: 139115CD1 

<400> 19 

Met Thr Leu Thr Gly Pro Leu Thr Thr Gin Tyr Val Tyr Arg Arg 
I ' 5 10 15 

He Trp Glu Glu Thr Gly Asn Tyr Thr Phe Ser Ser Asp Ser Asn 
20 25 30 

He Ser Glu Cys Glu Lys Asn Lys Ser Ser Pro He Phe Ala Phe 
35 40 45 

Gin Glu Glu Val Gin Lys Lys Val Ser Arg Phe Asn Leu Gin Met 
50 55 60 

Asp He Ser Gly Leu He Pro Gly Leu Val Ser Thr Phe He Leu 
65 70 75 

Leu Ser He Ser Asp His Tyr Gly Arg Lys Phe Pro Met He Leu 
80 85 90 

Ser Ser Val Gly Ala Leu Ala Thr Ser Val Trp Leu Cys Leu Leu 
95 100 105 

Cvs Tvr Phe Ala Phe Pro Phe Gin Leu Leu He Ala Ser Thr Phe 
^ ^ 110 115 120 

He Gly Ala Phe Cys Gly Asn Tyr Thr Thr Phe Trp Gly Ala Cys 
125 130 135 

Phe Ala Tyr He Val Asp Gin Cys Lys Glu His Lys Gin Lys Thr 
140 145 150 

He Arg He Ala He He Asp Phe Leu Leu Gly Leu Val Thr Gly 
155 160 165 

Leu Thr Gly Leu Ser Ser Gly Tyr Phe He Arg Glu Leu Gly Phe 
170 175 180 

Glu Trp ser Phe Leu He He Ala Val Ser Leu Ala Val Asn Leu 
185 190 195 

He Tyr He Leu Phe Phe Leu Gly Asp Pro Val Lys Glu Cys Ser 
200 205 210 

Ser Gin Asn Val Thr Met Ser Cys Ser Glu Gly Phe Lys Asn Leu 
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215 220 225 

Phe Tyr Arg Thr Tyr Met Leu Phe Lys Asn Ala Ser Oly Lys Arg 

Arg Phe Leu Leu %l Leu Leu Leu Phe Thr Val He Thr Tyr Phe 

245 

Phe val val He Gly He Ala Pro He Phe lie Leu Tyr Glu Leu 

260 zb3 
ASP ser pro Leu Cys Trp Asn Glu Val Phe He Gly Tyr Gly Ser 

275 2o\j ^ 

Ala Leu Gly Ser Ala Ser Phe Leu Thr Ser Phe Leu Gly He Trp 

290 -? -3 

Leu Phe ser Tyr Cys Met Glu Asp He His Met Ala Phe He Gly ■ , 
305 ' 
He Phe Thr Thr Met Thr Gly Met Ala Met Thr Ala Phe Ala Ser | 

_ Pro Phe Leu Phe 

33i - ' ' 340 345 



Thr Thr Leu Met Me? Phe Leu, Ala Arg Vai Pro Phe Leu Phe Thr 
335 T 
lie val Pro Phe Ser Val Leu Arg Ser Met Leu Ser Lys Val Val 



Arg ser Thr Glu gS Gly Thr Leu Phe Ala Cys He Ala Phe Leu 



355 



Glu Thr Leu Gly lly Val Thr Ala Val Ser Thr Phe Asn Gly He 



370 



380 —v.,^ 
Tyr ser Ala Thr Val Ala Trp -Tyr Pro Gly Phe Thr Phe Leu Leu 



385 



ser Ala Gly Leu Leu Leu Leu Pro Ala He Ser Leu Cys Val Val 

410 41b ^ 

Lys cys Thr Ser Trp Asn Glu Gly Ser Tyr Glu Leu Leu lie Gin 



400 



425 430 
Glu 
440 



Glu Glu Ser ser Glu Asp Ala Ser Asp Arg 



<210> 20 

<211> 337 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc^feature 

<223> incyte ID No: 1702940CD1 



Phe Gin ASP Gin Val Ser Arg Glu Asn Leu Leu Gin Leu Leu T^r 
20 

ASP ASP Glu Ala Trp Asn Gly Phe Val Ala Ala Ala Glu Leu Pro 
Arg ASP Glu Ala A^p Glu Leu Arg Lys Ala Leu Asn Lys Leu Ala 



iTlsl\ro Glu ser Ser He Phe He Glu Asp Tyr Leu Lys Tyr 
5 
;al 
20 
rrp 
35 
l^sp 

ser His Met Val ull Lys Asp Lys Asn Arg His Asp Lys Asp Gin 



55 
l^rg 

Gin His Arg Gin T^ Phe Leu Lys Glu Phe Pro Arg Leu Lys Arg 
80 85 
He Arg Lys Leu 

95 



Glu Leu Glu ASP His He Arg Lys Leu Arg Ala Leu Ala Glu Glu 

Ser 

110 115 120 

Asn ser Val Gly Thr Thr Ser Gly He Leu Thr Leu Leu Gly Leu 



val Glu Gin val His Arg Gly Thr -nir He Ala Asn Val Val Ser 



125 

Gly Leu Ala Pro Phe Thr Glu Gly He Ser Phe Val Leu Leu Asp 



Thr Gly Met Gly HI Gly Ala Ala Ala Ala Val Ala Gly He Thr 

155 T 
cys ser Val Val Glu Leu Val Asn Lys Leu Arg Ala Arg Ala Gin 



145 



Ala Arg Asn Leu Sp Gin Ser Gly Thr A^n Val Ala Lys Val Met 



175 



21/49 



wo 01/46258 PCT/USOO/35095 

185 190 
Lys Glu Phe val Gly Gly Asn -Ehr Pro Asn Val Leu Thr Leu Val 

200 205 210 

Asp Asn Trp Tyr Gin Val Thr Gin Gly lie Gly Arg Asn He Arg 

215 220 225 

Ala He Arg Arg Ala Arg Ala Asn Pro Gin Leu Gly Ala Tyr Ala 

230 235 
Pro Pro Pro His Val lie Gly Arg He Ser Ala Glu Gly Gly Glu 

245 250 255 

Gin Val Glu Arg Val Val Glu Gly Pro Ala Gin Ala Met Ser Arg 

260 265 270 

Gly Thr Met He Val Gly Ala Ala Thr Gly Gly He Leu Leu Leu 

275 280 _ 285 

Leu Asp Val Val Ser Leu Ala Tyr Glu Ser Lys His Leu Leu Glu 

290 295 ' -^uo 

Glv Ala Lys Ser Glu Ser Ala Glu Glu Leu Lys Lys Arg Ala Gin 

305 310 315 

Glu Leu Glu Gly Lys Leu Asn Phe Leu Thr Lys He His Glu Met 

320 325 330 

Leu Gin Pro Gly Gin Asp Gin 

335 

? 

<210> 21 

<211> 273 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 1703342CD1 

Mtt^Ali^Thr Trp Asp Glu Lys Ala Val Thx Arg Arg Ala Lys Val 

1 5 
Ala Pro Ala Glu Arg Met Ser Lys Phe Leu Arg His Phe Thr Val 
20 25 30 

val Gly Asp Asp Tyr His Ala Trp Asn lie Asn Tyr Lys Lys Trp 

Glu Asn Glu Glu Glu Glu Glu Glu Glu Glu Gin Pro Pro Pro Thr 

50 55 oO 

Pro Val Ser Gly Glu Glu Gly Arg Ala Ala Ala Pro Asp Val Ala 
65 70 75 

Pro Ala Pro Gly Pro Ala Pro Arg Ala Pro Leu Asp Phe Arg Gly 

80 85 
Met Leu Arg Lys Leu Phe Ser Ser His Arg Phe Gin Val He lie 
95 100 103 

He Cys Leu Val Val Leu Asp Ala Leu Leu Val Leu Ala Glu Leu 

110 115 120 

He Leu Asp Leu Lys He He Gin Pro Asp Lys Asn Asn Tyr Ala 

125 1^0 ^'^^ 

Ala Met Val Phe His Tyr Met Ser He Thr^Ile Leu Val Phe Phe 

140 145 1=>0 

Met Met Glu He He Phe Lys Leu Phe Val Phe Arg Leu Glu Phe 

155 160 lo5 

Phe His His Lys Phe Glu He Leu Asp Ala Val Val Val Val Val 

170 175 1^ 

Ser Phe He Leu Asp He Val Leu Leu Phe Gin Glu His Gin Phe 

185 190 
Glu Ala Leu Gly Leu Leu He Leu Leu Arg Leu Trp Arg Val Ala 

200 205 210 

Arg He He Asn Gly He He He Ser Val Lys Thr Arg Ser Glu 

215 220 225 

Arg Gin Leu Leu Arg Leu Lys Gin Met Asn Val Gin Leu Ala Ala 

230 235 240 

Lys He Gin His Leu Glu Phe Ser Cys Ser Glu Lys Glu Gin Glu 

245 250 255 

He Glu Arg Leu Asn Lys Leu Leu Arg Gin His Gly Leu Leu Gly 
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260 265 270 

GlU Val Asn 



<210> 22 
<211> 710 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 1727529CD1 

<400> 22 

Met Gly Gly Lys Gin Arg Asp Glu Asp Asp Glu Ala Tyx Gly Lys 
1 5 10 ' 15 

Pro Val Lys Tyr Asp Pro Ser Phe Arg Gly Pro lie Lys Asn Arg 

20 25 30 

Ser Cys Thr Asp Val lie Cys Cys Val Leu Phe Leu Leu Phe lie 
35 40 45 

Leu Gly Tyr He Val Val Gly He Val Ala Trp Leu Tyr Gly Asp 
50 55 60 

Pro Arg Gin Val Leu Tyr Pro Arg Asn Ser Thr Gly Ala Tyr Cys 
65 70 75 

Gly Met Gly Glu Asn Lys Asp Lys Pro Tyr Leu Leu Tyr Phe Asn 
80 85 90 

He Phe Ser Cys He Leu Ser Ser Asn He He Ser Val Ala Glu 
95 100 105 

Asn Gly Leu Gin Cys Pro Thr Pro Gin Val Cys Val Ser Ser Cys 

110 115 120 

Pro Glu Asp Pro Trp Thr Val Gly Lys Asn Glu Phe Ser Gin Thr 

125 130 135 

Val Gly Glu Val Phe Tyr Thr Lys Asn Arg Asn Phe cys Leu Pro 

140 145 150 

Gly Val Pro Trp Asn Met Thr Val He Thr Ser Leu Gin Gin Glu 

155 160 165 

Leu Cys Pro Ser Phe Leu Leu Pro Ser Ala Pro Ala Leu Gly Arg 

170 175 180 

Cys Phe Pro Trp Thr Asn He Thr Pro Pro Ala Leu Pro Gly He 

185 190 195 

Thr Asn Asp Thr Thr He Gin Gin Gly He Ser Gly Leu He Asp 

200 205 210 

Ser Leu Asn Ala Arg Asp He Ser Val Lys He Phe Glu Asp Phe 
• 215 220 225 

Ala Gin Ser Trp Tyr Trp He Leu Val Ala Leu Gly Val Ala Leu 

230 235 240 

Val Leu Ser Leu Leu Phe He Leu Leu Leu Arg Leu Val Ala Gly 

245 250 255 

Pro Leu Val Leu Val Leu He Leu Gly Val Leu Gly Val Leu Ala 

260 265 270 

Tyr Gly He Tyr Tyr Cys Trp Glu Glu Tyr Arg Val Leu Arg Asp 

275 280 285 

Lys Gly Ala Ser He Ser Gin Leu Gly Phe Thr Thr Asn Leu Ser 

290 295 300 

Ala Tyr Gin Ser Val Gin Glu Thr Trp Leu Ala Ala Leu He Val 

305 310 315 

Leu Ala Val Leu Glu Ala He Leu Leu Leu Val Leu He Phe Leu 

320 325 330 

Arg Gin Arg He Arg He Ala He Ala Leu Leu Lys Glu Ala Ser 

335 340 345 

Lys Ala Val Gly Gin Met Met Ser Thr Met Phe Tyr Pro Leu Val 

350 355 360 

Thr Phe Val Leu Leu Leu He Cys lie Ala Tyr Trp Ala Met Thr 

365 370 375 

Ala Leu Tyr Leu Ala Thr Ser Gly Gin Pro Gin Tyr Val Leu Trp 

380 385 390 

Ala Ser Asn He Ser Ser Pro Gly Cys Glu Lys Val Pro He Asn 
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395 400 405 

Thr Ser Cys Asn Pro Thr Ala His Leu Val Asn Ser Ser Cys Pro 

410 415 420 

Gly Leu Met Cys Val Phe Gin Gly Tyr Ser Ser Lys Gly Leu He 

425 430 435 

Gin Arg Ser Val Phe Asn Leu Gin He Tyr Gly Val Leu Gly Leu 

440 445 450 

Phe Trp Thr Leu Asn Trp Val Leu Ala Leu Gly Gin Cys Val Leu 

455 460 465 

Ala Gly Ala Phe Ala Ser Phe Tyr Trp Ala Phe His Lys Pro Gin 

470 475 480 

Asp He Pro Thr Phe Pro Leu He Ser Ala Phe He Arg Thr Leu 

485 490 495 

Arg Tyr His Thr Gly Ser Leu Ala Phe Gly Ala Leu He Leu Thr 

500 505 ' 510 

Leu Val Gin He Ala Arg Val He Leu Glu Tyr He Asp His Lys 

515 520 525 

Leu Arg Gly Val Gin Asn Pro Val Ala Arg Cys He Met Cys Cys 

530 535 540 

Phe Lys Cys Cys Leu Trp Cys Leu Glu Lys Phe He Lys Phe Leu 

545 550 555 

Asn Arg Asn Ala Tyr He Met He Ala He Tyr Gly Lys Asn Phe 

560 565 570 . 

Cys Val Ser Ala Lys Asn Ala Phe Met Leu Leu Met Arg Asn He 

575 580 585 

Val Arg Val Val Val Leu Asp Lys Val Thr Asp Leu Leu Leu Phe 

590 595 600 

Phe Gly Lys Leu Leu Val Val Gly Gly Val Gly Val Leu Ser Phe 

605 610 615 

Phe Phe Phe Ser Gly Arg He Pro Gly Leu Gly Lys Asp Phe Lys 

620 625 630 

Ser Pro His Leu Asn Tyr Tyr Trp Leu Pro He Met Thr Ser He 

635 640 645 

Leu Gly Ala Tyr Val He Ala Ser Gly Phe Phe Ser Val Phe Gly 

650 655 660 

Met Cys Val Asp Thr Leu Phe Leu Cys Phe Leu Glu Asp Leu Glu 

665 670 675 

Arg Asn Asn Gly Ser Leu Asp Arg Pro Tyr Tyr Met Ser Lys Ser 

680 685 690 

Leu Leu Lys He Leu Gly Lys Lys Asn Glu Ala Pro Pro Asp Asn 

695 700 705 

Lys Lys Arg Lys Lys 

710 

<210> 23 

<211> 476 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> inisc_f eature 

<223> Incyte ID No: 22893 33CD1 

<400> 23 

Glu Gin Asn Phe Asp Gly Thr Ser Asp Glu Glu His Glu Gin Glu 

15 10 15 

Leu Leu Pro Val Gin Lys His Tyr Gin Leu Asp Asp Gin Glu Gly 

20 25 30 

He Ser Phe Val Gin Thr Leu Met His Leu Leu Lys Gly Asn He 

35 40 45 

Glv Thr Gly Leu Leu Gly Leu Pro Leu Ala He Lys Asn Ala Gly 

^ 50 55 60 

He Val Leu Gly Pro He Ser Leu Val Phe He Gly He He Ser 

65 70 75 

Val His Cys Met His He Leu Val Arg Cys Ser His Phe Leu Cys 

80 85 90 

Leu Arg Phe Lys Lys Ser Thr Leu Gly Tyr Ser Asp Thr Val Ser 
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95 100 105 

Phe Ala Met Glu Val Ser Pro Trp Ser Cys Leu Gin Lys Gin Ala 

110 115 120 , 

Ala Trp Gly Arg Ser Val Val Asp Phe Phe Leu Val He Thr Gin 
125 130 135 

Leu Gly Phe Cys Ser Val Tyr He Val Phe Leu Ala Glu Asn Val 
140 145 1=>" 

Lys Gin Val His Glu Gly Phe Leu Glu Ser Lys Val Phe lie Ser 
155 160 165 

Asn ser Thr Asn Ser Ser Asn Pro cys Glu Arg Arg Ser Val Asp 
170 175 180 

Leu Arg He Tyr Met Leu Cys Phe Leu Pro Phe He He Leu Leu 

185 190 195 I 

Val Phe He Arg Glu Leu Lys Asn Leu Phe Val Leu Ser Phe Leu i 

200 205 , 210 

Ala Asn val Ser Met Ala Val Ser Leu Val He He Tyx Gin Tyr 
215 220 225 

Val Val Arg Asn Met Pro Asp Pro His Asn Leu Pro He Val Ala 
230 235 240 

Gly Trp Lys Lys Tyr Pro Leu Phe Phe Gly Thr Ala Val Phe Ala 
245 250 -^bt) 

Phe Glu Gly He Gly Val Val Leu Pro Leu Glu Asn Gin Met Lys 

260 265 
Glu Ser Lys Arg Phe Pro Gin Ala Leu Asn He Gly Met Gly He 
275 280 285 

val Thr Thr Leu Tyr Val Thr Leu Ala Thr Leu Gly Tyr Met Cys 
290 295 300 

Phe His Asp Glu He Lys Gly Ser He Thr Leu Asn Leu Pro Gin 
305 310 315 

Asp Val Trp Leu Tyr Gin Ser Val Lys He Leu Tyr Ser Phe Gly 
320 325 330 

He Phe Val Thr Tyr Ser He Gin Phe Tyr Val Pro Ala Glu He 
335 340 345 

He He Pro Gly He Thr Ser Lys Phe His Thr Lys Trp Lys Gin 
350 355 360 

He cys Glu Phe Gly He Arg Ser Phe Leu Val Ser He Thr Cys 
365 370 375 

Ala Gly Ala He Leu He Pro Arg Leu Asp He Val He Ser Phe 
380 385 f^yu 

Val Gly Ala Val Ser Ser Ser Thr Leu Ala Leu He Leu Pro Pro 
395 400 405 

Leu Val Glu He Leu Thr Phe Ser Lys Glu His Tyr Asn He Trp 

410 415 
Met Val Leu Lys Asn He Ser He Ala Phe Thr Gly Val Val Gly 
425 430 435 

Phe Leu Leu Gly Thr Tyr He Thr Val Glu Glu He He Tyr Pro 

440 445 
Thr Pro Lys Val Val Ala Gly Thr Pro Gin Ser Pro Phe Leu Asn 
455 460 465 

Leu Asn Ser Thr cys Leu Thr Ser Gly Leu Lys 
470 475 

<210> 24 

<211> 237 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> inisc_feature 

<223> Incyte ID No: 2720354CD1 

M^t^Gly^Leu Thr Phe He Asn Ala Leu Val Phe Gly Val Gin Gly 
1 5 10 15 

Asn Thr Leu Arg Ala Leu Gly His Asp Ser Pro Leu Asn Gin Phe 
20 25 30 

Leu Ala Gly Ala Ala Ala Gly Ala He Gin cys Val He Cys Cys 
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35 40 fS 

Pro Met Glu Leu Ala Lys Thr Arg Leu Sin Leu 61n Asp Ala Gly 

50 5^ T 

Pro Ala Arg Thr Tyr Lys Gly Ser Leu Asp Cyis Leu Ala Gin lie 

Tyr Gly His Glu Gly Leu Arg Gly Val Asn Arg Gly Met Val Ser- 

80 ® 
Thr Leu Leu Arg Glu Thr Pro Ser Phe Gly Val Tyr Phe Leu Thr 

Tyr Asp Ala Leu Thr Arg Ala Leu Gly Cys Glu Pro Gly Asp Arg 

Leu Leu Val Pro Lys Leu Leu Leu Ala Gly Gly Thr Ser Gly lie 
125 

val ser Trp Leu Ser Thr Tyr Pro Val Asp Val Val Lys Ser Arg 

Leu Gin Ala Asp Gly Leu Arg Gly Ala Pro Arg Tyr Arg Gly lie 
155 T 
Leu ASP cys Val His Gin Ser Tyr Arg Ala Glu Gly Trp Arg Val 

Phe Thr Arg Gly ilu Ala Ser Thr Leu Leu Arg Ala Phe Pro Val 

185 n 

Asn Ala Ala Thr Phe Ala Thr Val Thr Val Val Leu Thr Tyr Ala 

200 205 ^-j-^ 

Arg Gly Glu Glu Ala Gly Pro Glu Gly Glu Ala Val Pro Ala Ala 

Pro Ala Gly Pro Ala Leu Ala Gin Pro Ser Ser Leu 
230 235 

<210> 25 
<211> 345 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> iaisc_f eature 

<223> Incyte ID No: 3038193CD1 

Mef irg^eu Leu Glu Arg Met Arg Lys Asp Trp Phe Met Val Gly 



lie val Leu Ala 111 Ala Gly Ala Lys Leu Glu Pro Ser He Gly 

val Asn Gly Gly P?o Leu Lys Pro Glu lie Thr Val Ser Tyr lie 

35 40 
Ala val Ala Thr He Phe Phe Asn Ser Gly Leu Ser Leu Lys l^r 

Glu Glu Leu Thr sll Ala Leu Val His Leu Lys Leu His Leu Phe 
65 

He Gin He Phe Thr Leu Ala Phe Phe Pro Ala Thr He Trp Leu 

80 °^ 
Phe Leu Gin Leu Leu Ser He Thr Pro lie Asn Glu Trp Leu Leu 

95 100 
Lys Gly Leu Gin Thr Val Gly Cys Met Pro Pro Pro Val Ser Ser 

110 iij 
Ala Val lie Leu Thr Lys Ala Val Gly Gly Asn Glu Gly He Val 

125 -^"^^ 
He Thr Pro Leu Leu Leu Leu Leu Phe Leu Gly Ser Ser Ser Ser 

140 -T 
Val Pro Phe Thr Ser He Phe Ser Gin Leu Phe Met Thr Val Val 

155 160 
Val Pro Leu He He Gly Gin He Val Arg Arg Tyr He Lys Asp 

Trp Leu Glu Arg lJs Lys Pro Pro Phe Gly Ala He Ser Ser Ser 

Val Leu Leu Met He He Tyr Thr Thr Phe Cys Asp Thr Phe Ser 
200 205 ^-^^ 

Asn Pro Asn He Asp Leu Asp Lys Phe Ser Leu Val Leu He Leu 
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215 220 225 

Phe lie lie Phe Ser He 61n Leu Ser Phe Met Leu Leu Thr Phe 

230 235 -^^^ I 

lie Phe Ser Thr Arg Asn Asn Ser Gly Phe Oto Pro Ala Asp Thr 

245 250 
Val Ala He He Phe Cys Ser Thr His Lys Ser Leu Thr Leu Gly 

He Pro Met Leu lys He Val Phe Ala Gly Tyr Glu His Leu Ser 

Leu He Ser Val Pro Leu Leu He Tyr His Pro Ala Gin He Leu 

290 295 300 

Leu Gly ser Val Leu Val Pro Thr He Lys Ser Trp Met Val Ser 

305 310 I 

Arg Gin Lys Lys Leu Leu Gin Thr Arg Gly Pro Leu Ala Asn Leu 

320 325 330 , 

Asn Asn Pro Glu Gly Leu Glu Tyr Leu Ser He Lys Phe Gly His 

335 340 -^^^ 



<210> 26 
<211> 521 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 3460979CD1 

Met^Ala\la Leu Ala Pro Val Gly Ser Pro Ala Ser Arg Gly Pro 
1 5 10 '^^ 

Arg Leu Ala Ala Gly Leu Arg Leu Leu Pro Met Leu Gly Leu Leu 

20 25 
Gin Leu Leu Ala Glu Pro Gly Leu Gly Arg Val His His Leu Ala 

35 40 
Leu Lys Asp Asp Val Arg His Lys Val His Leu Asn Thr Phe Gly 

50 55 
Phe Phe Lys Asp Gly Tyr Met Val Val Asn Val Ser Ser Leu Ser 

Leu Asn Glu Pro Glu Asp Lys Asp Val Thr He Gly Phe Ser Leu 

80 85 
Asp Arg Thr Lys Asn Asp Gly Phe Ser Ser Tyr Leu Asp Glu Asp 

95 100 
val Asn Tyr Cys He Leu Lys Lys Gin Ser Val Ser Val Thr Leu 

110 115 J-'i" 

Leu lie Leu Asp He Ser Arg Ser Glu Val Arg Val Lys Ser Pro 

125 1*^0 -^^-^ 

Pro Glu Ala Gly Thr Gin Leu Pro Lys He He Phe Ser Arg Asp 

140 145 
Glu Lys Val Leu Gly Gin Ser Gin Glu Pro Asn Val Asn Pro Ala 

155 160 
ser Ala Gly Asn Gin Thr Gin Lys Thr Gin Asp Gly Gly Lys Ser 

170 175 -LOW 

Lys Arg Ser Thr Val Asp Ser Lys Ala Met Gly Glu Lys Ser Phe 

185 190 
Ser Val His Asn Asn Gly Gly Ala Val Ser Phe Gin Phe Phe Phe 

200 205 210 

Asn He ser Thr Asp Asp Gin Glu Gly Leu Tyr Ser Leu Tyr Phe 

215 220 
His Lys cys Leu Gly Lys Glu Leu Pro Ser Asp Lys Phe Thr Phe 

230 235 ^'^^ 

Ser Leu Asp He Glu He Thr Glu Lys Asn Pro Asp Ser Tyr Leu 

245 250 
Ser Ala Gly Glu He Pro Leu Pro Lys Leu Tyr He Ser Met Ala 

260 265 ^'^ 

Phe Phe Phe Phe Leu Ser Gly Thr He Trp He His He Leu Arg 

275 280 28b 
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Lys Arg Arg Asn Asp Val Phe Lys He His Trp Leu Met Ala Ala 

290 295 300 

Leu Pro Phe Thr Lys Ser Leu Ser Leu Val Phe His Ala He Asp 

305 310 315 

Tyr His Tyr He Ser Ser Gin Gly Phe Pro He Glu Gly Trp Ala 

320 325 330 

Val Val Tyr Tyr He Thr His Leu Leu Lys Gly Ala Leu Leu Phe 

335 340 345 

He Thr He Ala Leu He Gly Thr Gly Trp Ala Phe He Lys His 

350 355 360 

He Leu Ser Asp Lys Asp Lys Lys He Phe Met He Val He Pro 

365 370 375 

Leu Gin Val Leu Ala Asn Val Ala Tyr He He He Glu Ser Thr 

380 385 390 

Glu Glu Gly Thr Thr Glu Tyr Gly Leu Trp Lys Asp Ser Leu Phe 

395 400 405 

Leu Val Asp Leu Leu Cys Cys Gly Ala He Leu Phe Pro Val Val 

410 415 420 

Trp Ser He Arg His Leu Gin Glu Ala Ser Ala Thr Asp Gly Lys 

425 430 435 

Ala Ala He Asn Leu Ala Lys Leu Lys Leu Phe Arg His Tyr Tyr 

440 445 450 

Val Leu He Val Cys Tyr He Tyr Phe Thr Arg He He Ala Phe 

455 460 465 

Leu Leu Lys Leu Ala Val Pro Phe Gin Trp Lys Trp Leu Tyr Gin 

470 475 480 

Leu Leu Asp Glu Thr Ala Thr Leu Val Phe Phe Val Leu Thr Gly 

485 490 495 

Tyr Lys Phe Arg Pro Ala Ser Asp Asn Pro Tyr Leu Gin Leu Ser 

500 505 510 

Gin Glu Glu Glu Asp Leu Glu Met Glu Ser Val 

515 520 

<210> 27 
<211> 555 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7472200CD1 

<400> 27 , 

•.£^11 v«l Twr PViei Pro Pro Ser Lys Leu Gin Gin Gin Gin 

10 15 

1 Ala Gin Gin Leu Ala Gin Ser 
25 30 

2 Gly Lys Arg Thr Thr He His 
40 45 

a Lys Ala Ser Arg Trp Glu Arg 
55 60 
1 Ser Ala Phe Leu Gly Ala Val 
70 75 
a Arg Tyr Asn Ala Ala His Phe 
85 90 
g Phe Pro Val Tyr Arg He Pro 
100 105 
Q Arg Asn Arg Leu Asn Tirp Gin 
115 120 
g Phe Leu Ala Asn Gly Ser Asn 
130 135 
u Leu He Val Gly Thr Tyr Asp 
145 150 
n Ser Phe Glu Arg Leu Arg Asn 
160 165 
r Val Asn Phe Ser Gin Val Val 
175 180 



Met 


Thr 


Leu 


Val 


Tyr 


Phe 


Pro 


1 








5 






Gin 


Pro 


Ser 


Arg 


Ser 
20 


Ser 


Arg 


Ser 


Trp Gin 


Leu 


Ala 


Leu 


Arg 










35 






Gly 


Leu Asp 


Arg 


Leu 


Leu 


Ser 








50 






Phe 


yal 


Trp 


Leu 


Cys 


Thr 


Phe 








65 






Tyr 


Val 


Cys 


Leu 


He 


Leu 


Ser 








80 






Gin 


Thr 


Val 


Val 


Asp 
95 


Ser 


Thr 


Phe 


Pro 


Val 


He 


Thr 
110 


He 


Cys 


Arg 


Leu 


Ala 


Glu 


Ala 


Lys 


Ser 








125 






Ser 


Ala 


Gin 


Gin 


Glu 
140 


Leu 


Phe 


Asp 


Ala 


Tyr 


Phe Gly His 


Phe 










155 






Gin 


Pro 


Thr 


Glu 


Leu 
170 


Leu 


Asn 
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Asp 


PhG 


Met 


Thr 


Trp 


Arg 


Cys 


Asn Glu 


Leu 


Leu 


Ala 


Glu 


Cys 


Leu 








185 




190 










195 


Trp 


Arg 


His 


His 


Ala 


Tyr 


Asp 


Cys Cys 


Glu 


lie Arg 


Ser 


Lys 


Arg 










200 








205 










210 


Arg 


Ser 


Lys 


Asn 


Gly 


Leu 


Cys 


Trp Ala 


Phe 


Asn 


Ser 


Leu 


Glu 


Thr 










215 






220 










225 


Glu 


Glu 


Gly 


Arg 


Arg 


Met 


Gin 


Leu Leu 


Asp 


Pro 


Met 


Trp 


Pro 


Trp 








230 








235 










240 


Arg 


Thr 


Gly 


Ser 


Ala 


Gly 


Pro 


Met Ser 


Ala 


Leu 


Ser 


Val 


Arg 


Val 






245 






250 










255 


Leu 


He 


Gin 


Pro 


Ala 


Lys 


His 


Trp Pro 


Gly 


His Arg 


Glu 


Thr 


Asn 










260 




265 










270 


Ala 


Met 


Lys 


Gly 


He 


Asp 


Val 


Met Val 


Thr 


Glu 


Pro 


Phe 


Val 


Trp 






275 








280 










285 


His 


Asn 


Asn 


Pro 


Phe 
290 


Phe 


Val 


Ala Ala 


Asn 
295 


Thr 


Glu 


Thr 


Thr 


Met 
300 


Glu 


He 


Glu 


Pro 


Val 


He 


Tyr 


Phe Tyr 


Asp 


Asn Asp 


Thr 


Arg 


Gly 










305 






310 










315 


Val 


Arg 


Ser 


Asp 


Gin 


Arg 


Gin 


Cys Val 


Phe 


Asp Asp 


Glu 


His 


Asn 










320 








325 










330 


Ser 


Lys 


Asp 


Phe 


Lys 


Ser 


Leu 


Gin Gly 


Tyr 


Val 


Tyr 


Met 


He 


Glu 








335 








340 










345 


Asn 


Cys 


Gin 


Ser 


Glu 


Cys 


His 


Gin Glu 


Tyr 


Leu Val 


Arg 


Tyr 


Cys 










350 






355 










360 


Asn 


Cys 


Thr 


Met 


Asp 


Leu 


Leu 


Phe Pro 


Pro 


Asp 


Leu 


Leu 


He 


Tyr 








365 








370 










375 


Ser 


His 


Asn 


Pro 


Gly 
380 


Glu 


Lys 


Glu Phe 


Val 
385 


Arg 


Asn 


Gin 


Phe 


Gin 
390 


Gly 


Met 


Ser 


Cys 


Lys 
395 


Cys 


Phe 


Arg Asn 


Cys 
400 


Tyr 


Ser 


Leu 


Asn 


Tyr 
405 


He 


Ser 


Asp 


Val 


Arg 


Pro 


Ala 


Phe Leu 


Pro 


Pro 


Asp 


Val 


Tyr 


Ala 








410 








415 










420 


Asn 


Asn 


Ser 


Tyr 


Val 


Asp 


Leu 


Asp Val 


His 


Phe 


Arg 


Phe 


Glu 


Thr 








425 




430 










435 


He 


Met 


Val 


Tyr 


Arg 


Thr 


Ser 


Leu Val 


Phe 


Gly 


Trp 


Val 


Asp 


Leu 








440 








445 










450 


Met 


Val 


Ser 


Phe 


Gly 


Gly 


He 


Ala Gly 


Leu 


Phe 


Leu 


Gly 


Cys 


Ser 










455 






460 










465 


Leu 


He 


Ser 


Gly 


Met 


Glu 


Leu 


Ala Tyr 


Phe 


Leu 


Cys 


He 


Glu 


Val 








470 








475 










480 


Pro 


Ala 


Phe 


Gly 


Leu 


Asp 


Gly 


Leu Arg 


Arg 


Arg 


Trp 


Lys 


Ala 


Arg 








485 








490 










495 


Arg 


Gin 


Met 


Asp 


Leu 


Gly 


Val 


Thr Val 


Pro 


Thr 


Pro 


Thr 


Leu 


Asn 






500 






505 










510 


Phe 


Gin 


Gin 


Thr 


Thr 
515 


Pro 


Ser 


Gin Leu 


Met 
520 


Glu 


Asn 


Tyr 


He 


Met 
525 


Gin 


Leu 


Lys 


Ala 


Glu 


Lys 


Ala 


Gin Gin 


Gin 


Lys 


Ala 


Asn 


Phe 


Gin 








530 






535 










540 


Asn 


Trp 


His 


Arg 


He 


Thr 


Phe 


Ala Gin 


Lys 


His 


Val 


He 


Gly 


Lys 






545 








550 










555 



<210> 28 

<211> 2080 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 1416107CB1 

<400> 28 

ggcggttcag gcgccagagc tggccgatcg gcgttggccg ccgacatgac gcccgaggac 60 

ccagaggaaa cccagccgct tctggggcct cctggcggca gcgcgccccg cggccgccgc 120 

gtcttcctcg ccgccttcgc cgctgccctg ggcccactca gcttcggctt cgcgctcggc 180 

tacagctccc cggccatccc tagcctgcag cgcgccgcgc ccccggcccc gcgcctggac 240 

gacgccgccg cctcctggtt cggggctgtc gtgaccctgg gtgccgcggc ggggggagtg 300 
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ctgggcggct ggctggtgga ccgcgccggg cgcaagctga gcctcttgct gtgctccgtg 360 
cccttcgtgg ccggctttgc cgtcatcacc gcggcccagg acgtgtggat gctgctgggg 420 
ggccgcctcc tcaccggcct ggcctgcggt gttgcctccc tagtggcccc ggtctacatc 480 
tccgaaatcg cctacccagc agtccggggg ttgctcggct cctgtgtgca gctaatggtc 540 
gtcgtcggca tcctcctggc ctacctggca ggctgggtgc tggagtggcg ctggctggct 600 
gtgctgggct gcgtgccccc ctccctcatg ctgcttctca tgtgcttcat gcccgagacc 660 
ccgcgcttcc tgctgactca gcacaggcgc caggaggcca tggccgccct gcggttcctg 720 
tggggctccg agcagggctg ggaagacccc cccatcgggg ctgagcagag ctttcacctg 780 
gccctgctgc ggcagcccgg catctacaag cccttcatca tcggcgtctc cctgatggcc 840 
ttccagcagc tgtcgggggt caacgccgtc atgttctatg cagagaccat ctttgaagag 900 
gccaagttca aggacagcag cctggcctcg gtcgtcgtgg gtgtcatcca ggtgctgttc 9-60 
acagctgtgg cggctctcat catggacaga gcagggcgga ggctgctcct ggtcttgtca 1020 
ggtgtggtca tggtgttcag cacgagtgcc ttcggcgcct acttcaagct gacccagggt 1080 
ggccctggca actcctcgca cgtggccatc tcggcgcctg tctctgcaca gcctgttgat 1140 
gccagcgtgg ggctggcctg gctggccgtg ggcagcatgt gcctcttcat cgccggcttt 1200 
gcggtgggct gggggcccat cccctggctc ctcatgtcag agatcttccc tctgcatgtc 1260 
aagggcgtgg cgacaggcat ctgcgtcctc accaactggc tcatggcctt tctcgtgacc 1320 
aaggagttca gcagcctcat ggaggtcctc aggccctatg gagccttctg gcttgcctcc 1380 
gctttctgca tcttcagtgt ccttttcact ttgttctgtg tccctgaaac taaaggaaag 1440 
actctggaac aaatcacagc ccattttgag gggcgatgac agccactcac taggggatgg 1500 
agcaagcctg tgactccaag ctgggcccaa gcccagagcc cctgcctgcc ccaggggagc 1560 
cagaatccag ccccttggag ccttggtctg cagggtccct ccttcctgtc atgctccctc 1620 
cagcccatga cccggggcta ggaggctcac tgcctcctgt tccagctcct gctgctgctc 1680 
tgaggactca ggaacacctt cgagctttgc agacctgcgg tcagccctcc atgcgcaaga 1740 
ctaaagcagc ggaagaggag gtgggcctct aggatctttg tcttctggct ggaggtgctt 1800 
ttggaggttg ggtgctgggc attcagtcgc tcctctcacg cggctgcctt atcgggaagg 1860 
aaatttgttt gccaaataaa gactgacaca gaaaatcagg tcagtgtctc tgggctttgt 1920 
gcaagctcag tttgaaaagg gtttattccc atcactgccc aggacaccct gtggctttac 1980 
ttgctcatgg tcagccaagc ttacccttca cactgagaag tcatttctgg ctacttcctt 2040 
gggctcagtt ccctgggtca tcagccatca aatcttgttg 2080 

<210> 29 

<211> 2128 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 1682513CB1 

<400> 29* ^ 
ctggccctag ggagctgccc ctgtcgctgg ctgcctgcac caaccagccc cacattgtca 6 0 
actacctgac ggagaacccc cacaagaagg cggacatgcg gcgccaggac tcgcgaggca 120 
acacagtgct gcatgcgctg gtggccattg ctgacaacac ccgtgagaac accaagtttg 180 
ttaccaagat gtacgacctg ctgctgctca agtgtgcccg cctcttcccc gacagcaacc 240 
tggaggccgt gctcaacaac gacggcctct cgcccctcat gatggctgcc aagacgggca 300 
agattgggaa ccgccacgag atgctggctg tggagcccat caatgaactg ctgcgggaca 3 60 
agtggcgcaa gttcggggcc gtctccttct acatcaacgt ggtctcctac ctgtgtgcca 420 
tggtcatctt cactctcacc gcctactacc agccgctgga gggcacaccg ccgtaccctt 480 
accgcaccac ggtggactac ctgcggctgg ctggcgaggt cattacgctc ttcactgggg 540 
tcctgttctt cttcaccaac atcaaagact tgttcatgaa gaaatgccct ggagtgaatt 600 
ctctcttcat tgatggctcc ttccagctgc tctacttcat ctactctgtc ctggtgatcg 660 
tctcagcagc cctctacctg gcagggatcg aggcctacct ggccgtgatg gtctttgccc 720 
tggtcctggg ctggatgaat gccctttact tcacccgtgg gctgaagctg acggggacct 780 
atagcatcat gatccagaag attctcttca aggacctttt ccgattcctg ctcgtctact 840 
tgctcttcat gatcggctac gcttcagccc tggtctccct cctgaacccg tgtgccaaca 900 
tgaaggtgtg caatggggac cagaccaact gcacagtgcc cacttacccc tcgtgccgtg 960 
acagcgagac cttcagcacc ttcctcctgg acctgtttaa gctgaccatc ggcatgggcg 1020 
acctggagat gctgagcagc accaagtacc ccgtggtctt catcatcctg ctggtgacct 1080 
acatcatcct cacctttgtg ctgctcctca acatgctcat tgccctcatg ggcgagacag 1140 
tgggccaggt ctccaaggag agcaagcaca tctggaagct gcagtgggcc accaccatcc 1200 
tggacattga gcgctccttc cccgtattcc tgaggaagtc cttccgctct ggggagatgg 1260 
tcaccgtggg caagagctcg gacggcactc ctgaccgcag gtggtgcttc agggtggatg 1320 
aggtgaactg gtctcactgg aaccagaact tgggcatcat caacgaggac ccgggcaaga 1380 
atgagaccta ccagtattat ggcttctcgc ataccgtggg ccgcctccgc agggatcgct 1440 
ggtcctcggt ggtaccccgc gtggtggaac tgaacaagaa ctcgaacccg gacgaggtgg 1500 
tggtgcctct ggacagcacg gggaaccccc gctgcgatgg ccaccagcag ggttaccccc 1560 
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gc«gtgg.g g,ctg.tgac g=c==»=tct «g.=tgca gjc^go|cc jactt^totg 1620 
rtg-SSc ^SSgS SIS^cS ^Itlllil lltl ' 

IS! iiii iiiii ipS 11 II liii ii 
iS Si! iiii iiiii iiiii ^ IH 

ataaatgttt attcattgaa aaaaaaaa 

<210> 30 j , 

<211> 2825 I 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc^feature 

<223> Incyte ID No: 2446438CB1 

iiii «i S3 iis iiiii iiii I 

liii! r= Si iiiiiS 1 iii ! i 
i ss? «s i |s s i - 

S= 1=1^ « s g s » 

tcSSgtl SlSSgt ^S^S^lgS SSS? g~ 
McScg .ctoglaata c.cagagggc tcoacaggta W^gf^J StlS??gg 8" 
gtgotgaicc ttaiggacgg ggtoaatgcc tgo.ttctgc =»=taotgoj 9«t||»=5?| 

SllgS" SISSS KSS Ucjg^ -0 

ESS???? IITX S?SS?= SS?ggl SSS i-o 

?^gr^=i s» =i 

gcactggtga ccagcatgta tg^tgggctc ^t^^^^S^^g g|9=^ | ggccgccaag 1320 

ctttcccgaa agttcaccga gtggtgctat ^g^cctgccc 959^9 tcattgcaag 1500 

Ig^SSS? SSc^gSt g« S S gc.gg|.i i||0 

«g? SK» i i s Mb "IS 

gtgctgggct ggctgaacct gctttactat acacgtggct: ^^^^a^pj:^^ oatctactta 2040 

I^SI't^ LSccccL t^=»=«t^ .=agtg-9c f^SS?? ft?S?c 22" 
gagggcaacg gggcccagta caggggtatc ctggaagccc cctcgguyu ^ 2280 

ds^s?i? s=?e s« s« I I i ? -^s 

?=« Scgl i |SS g Mi 

gacccgtcag gggcaggtgt ccctcgaact ctcgagaacc "s^^^^^^ ^ ^cc 2700 
aaggaggatg aggatggtgc ctctgaggaa aactatgtgc ccgtccagct 2760 
aaitgatggc ccagatgcag caggaggcoa gaggacagag cagaggatct ttccaaccac 
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atctgctggc tctggggtcc cagtgaattc tggtggcaaa tatatatttt cactaaaaaa 2820 
aaaaa 



<210> 31 

<211> 1718 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> mis cofeature 

<223> Incyte ID No: 2817822CB1 



<400> 31 

gcctcggtgt tcccacctag gggcgggcag 
tgcatgtggc agggctgcgc agtggagcgg 
ctgccccagt ccccccggcc caggcggcca 
gggggtgagt tccacaccac caccctgggt 
gcagagatgt tctctagctt agccaaggcc 
gaccgcccca gcacctattt cagacccatc 
acacagcaca tccctgaagt gtaccgtgag 
aagctgctgg aggacatgcc acagatcttt 
ctgcaagtgc cgggctacag cgagaacctg 
gccataacag cacggaagtc cagcgtgctt 
gcatattatt cagaggtcct gtgttttctg 
aagtttgggc cctggaaggc ggtcctagac 
gacattaagg cccaggggta caaggtattc 
agaaacgaat tccattttaa catttattca 
gcagagactg ttatgaattc tggcgtggct 
tttcttttaa tttcacaaac atcaggcaat 
aatattgatc actcgtttaa ggactttcca 
ctagcaactt gcagctactt ccttttcaaa 
aagtccaata acaagaccaa gtaagaatgt 
tgacaacagg aacatacaag atactgtgaa 
acatagcccc agcttggggt ccaatccatc 
tgctttcctg atcccctttg cgagatgctg 
ctctagagtg gaggttttca aagtgcatca 
caaatcctca ggccccacct cagacctact 
ttctgattta ccaaaccctc caagtgattt 
gaaaagaatt gctacctctt gtatggaggt 
ttcctttccc agagctcctc atggaatcaa 
tcttactcag tttttttcct ctgtcctacg 
cactccatca ataaaccact tgcacgagaa 



ccaggggcac ttccgctggc ccaagtgatc 60 
ccagtgggca ggatgacgag ccagacccct 120 
acgatgtcta ctgttgtgga gctgaacgtc 180 
accctgagga agtttccggg ctcaaagctg 240 
tccacggacg cggagggccg cttcttcatc 300 
ctggactacc tgcgcactgg gcaagtgccc 360 
gctcagttct acgaaatcaa gcctttggtc 420 
ggtgagcagg tgtctcggaa gcagtttttg 480 
gagctcatgg tgcgcctggc acgtgcagaa 540 
gtgtgcctgg tggaaactga ggagcaggat 600 
caggataaga agatgttcaa gtctgttgtc 660 
aacagcgacc tcatgcactg cctggagatg 720 
tccaagttct acctgacgta ccccaccaaa 780 
ttcaccttca cctggtggtg atcctcagga 840 
tatgaaatta aaagttgcca tcaaagccat 900 
ttccagggtt ggtctagagt cttgccacta 960. 
ctccattgca actgatgcca ctatatttgc 1020 
gcctcatgta tctcccagac. ccttctcttg 1080 
ttcaacaatg cgttggcaag agatgtgaga 1140 
tctagatgtt ctgacctaaa gatgtagtct 1200 
tgtccctggc atgtgccttc atgtagtagg 1260 
tgggtgctaa cacctcagag ctgtcctctt 1320 
tcagcattac ctgtgaactt gctggaaata 1380 
gaatcagaat ctctgggggt tggcacagca 1440 
tgatgtattc taattttgag accatctcta 1500 
acaaaagact gacctcttae atcaaggaac 1*560 
gctgaagtca gtcttcttct gagagcacat 1620 
ctgcttccct cactcccctt ctcctaagag 1680 
aaaaaaaa 1718 



<210> 32 

<211> 2000 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 4009329CB1 



<400> 32 

gacgaatttg aaaccagggg gtgtcctgtt 
ctccagttgg aggggttcgg gagaaccata 
ggccaccgga gccgccagct gtttggaact 
agggccaggc cccgtggggg cagatggccg 
gtgtgctttg tgtgctgcta atggcggaga 
gagctcacat tagcccccag tttccagctt 
gccgcaaggt gtgtggcctg aatgtctctg 
actgccacag tgatgggggg tacctggact 
ccagcctcct ccctctggct gtcactctct 
ttctgggagt caccgcagcc aagtttttct 
tgaagctctc ccacaacgtg gcaggcgtca 
acatcttcag tgccctggtg gccttctctg 
cactgtttgg cgctggcgtg ctggttacca 
accccttcat ggctgcctcc aggcccttct 



tgaacttggt gccagataga gtaactcgga 60 
gaagaggaag ggccgtgtct tccgtggaca 120 
gagctactgc agaaagggaa gtggagagta 180 
gcagaaggct gaatctgcgc tgggcactga 240 
cagtgtctgg gactaggggc tcgtctacag 3 00 
caggtgtgaa ccagaccccc gtggtagact 3 60 
accgctgtga cttcatccgg accaaccctg 420 
acctggaagg catcttctgc cacttccctc 480 
acgtttcctg gctgctctac ctgtttctga 540 
gccccaactt gtcggccatt tctaccacac 600 
ccttcctggc atttgggaat ggtgcacctg 660 
acccgcacac agccggcctg gcccttgggg 720 
cagtggtggc cggaggcatt accatcctac 780 
tcagggacat cgttttctac atggtggctg 840 
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tgttcctgac cttcctcatg ctcttccgtg 
acctgggctt gtatgtgttc tatgtggtca 
ggcaacggag aggatctctg ttctgcccca 
ccgaggagga ccgggtatct tctaatacca 
cgctgttctt ctaccaggag accacggctc 
attacatgaa gtggagaagg aaatcagcat 
ctgtggagtt cctgctgctc ctcacagtcc 
actggaaacg gcccctcaac tgtctgcafcc 
ccctgcagtc ggggacctat ggtgtctatg 
tggtggtgat cgcaggcaca gccttggctt 
agccccccag gcttcactgg ctctttgctt 
tcaacgcggc cgccacagag gtggtgaaca 
tgagcaacac tgtgctgggg ctcacgctgc 
tctcggattt cacactggct cgccagggct 
gcggcatcat cttcaacatc ctcgtgggtg 
gaagccacac agaagtgaag ctggagccag 
ccctggggct cagcctcgtc ttctccctgg 
gcagagtcta tggcttctgc ctgctcctct 
tcattgaatt tggagtgatt cacctgaaaa 
cctcactgca ggcaggagcc 

<210> 33 

<211> 2216 

<212> DNA 

<213> Homo sapiens 



gcagggtcac cctggcatgg gctctgggtt 900 
ctgtgattct ctgcacctgg atctaccaac 960 
tgccagttac tccagagatc ctctcagact 1020 
acagctatga ctacggtgat gagtaccggc 1080 
agatcctggt ccgggccctc aatcccctgg 1140 
actggaaagc cctcaaggtg ttcaagctgc 1200 
ccgtcgtgga cccggacaag gatgaccaga 1260 
tggttatcag ccccctggtt gtggtcctga 1320 
agataggcgg cctcgttccc gtctgggtcg 1380 
cagtgacctt ttttgccaca tctgacagcc 1440 
tcctgggctt tctgaccagc gccctgtgga 1500 
tcttgcggtc cctgggtgtg gtcttccggc 1560 
tggcctgggg gaacagcatt ggagatgcct 1620 | 
acccacggat ggcgttctcc gcctgctttg 1680 i 
tggggctggg ctgcctgctc cagatctccc 1740 j 
acggactgct ggtgtgggtc ctggcaggcg 1800; 
tctcagtccc attgcagtgc ttccagctca 1860 
tctacctgaa cttccttgtc gtggccctcc 1920 
gcatgtgact gaagccgctt agtgctgtgg 1980 



<220> 

<221> misc_feature 

<223> Incyte ID No: 6618083CB1 



<400> 33 . I- 

gaaaactctt cctgaaggag atgcagagga agattcgaac tggaggaaaa ccctaa^ta 60 
aacaataaca acaaaagttc aaaacctgaa aagtgaacca tgaagctcag taaaaaggac 120 
cgaggagaag atgaagaaag tgattcagcg aaaaagaaat tggactggtc ctgctcgctc 180 
ctcgtggcct ccctcgcggg cgccttcggc tcctocttcc tctacggcta caacctgtcg 240 
gtggtgaatg cccccacccc gtacatcaag gccttttaca atgagtcatg ggaaagaagg 300 
catggacgtc caatagaccc agacactctg actttgctct ggtctgtgac tgtgtccata 360 
ttcgccatcg gtggacttgt ggggacgtta attgtgaaga tgattggaaa ggttcttggg 420 
agglagcaci Itttgctggc caataatggg tttgcaattt ctgctgcatt gctgatggcc 480 
tictcgctcc aggcaggagc ctttgaaatg ctcatcgtgg gacgcttcat catgggcata 540 
gitggaggcg tcgccctcag tgtgctcccc atgtacctca gtgagatctc acccaaggag 600 
Itciltllct ctltggggca ggtgactgcc atctttatct gcattggcgt gttcactggg 660 
cagcttctgg gcctgcccga gctgctggga aaggagagta cctggcoata cctgtttgga 720 
gtgattgtgg tccctgccgt tgtccagctg ctgagccttc cctttctccc ggacagccca 780 
cgctacctgc tcttggagaa gcacaacgag gcaagagctg tgaaagcctt ccaaacgttc 840 
ttgggtaaag cagacgtttc ccaagaggta gaggaggtcc tggctgagag ccacgtgcag 900 
aggagcatcc gcctggtgtc cgtgctggag ctgctgagag ctccctacgt ccgctggcag 960 
gtggtcaccg tgattgtcac catggcctgc taccagctct gtggcctcaa tgcaatttgg 1020 
ttctatacca acagcatctt tggaaaagct gggatccctc tggcaaagat cccatacgtc 1080 
accttgagta cagggggcat cgagactttg gctgccgtct tctctggttt ggtcattgag 1140 
cacctgggac ggagacccct cctcattggt ggctttgggc tcatgggcct cttctttggg 1200 
accctcacca tcacgctgac cctgcaggac cacgccccct gggtccccta cctgagtatc 1260 
gtgggcattc tggccatcat cgcctctttc tgcagtgggc cagctgtttt cccagaagaa 1320 
acggtaaatg tcagcattgt atctgagtga aaagttgacc ttcttcccca cccatgcaca 1380 
caScaagcc agattggact catctgcata tctgcctgaa gttctttgct aaccaaaaat 1440 
cactaagctt agccttctct gttttttttt tcctaagccc tcccaagact ttttgcaatg 1500 
atcctgattc tgttcoaagt gtttgcaaot gtggctttot tttgactgta gaacatgctg 1560 
catttccagg gctttaaatg ctgggctccc catcagtgtc tatgggactc cctggaggga 1620 
aggccacctg cacctcccaa tcccagatca cctgtcagcc cctgccctcc gcttcctcaa 1680 
tccatcttca accccctgtg ttgacccagc acctgggcct tgctggctag caatgacttt 1740 
agccacaaga tggaccaggg tttagaagct tcatttaaac tcacattgac agtgtacagt 1800 
ttaaagcctc agggaactta cctgtctaag aaaagctgcc acttagacca tgagaccatc 1860 
ttgcatcttc ctaagtggac agggaagagc aagtccccag gggagccacc cgggaaagtg IS^o 
tggcaggaag atgctcagag ctgaatggca gagagactca tgggcctgct ctccatgatt 1980 
aaagaagagg gatggatctc ccaggagagg gccaggaggc cgcctgaggc agcttctgtg 2U4U 
aggaacaggt cgatgtaaga agacttgaca aggagttgaa attaggtgaa agcaaagaaa 2100 
gaaaacaaga gaggcagttt cctgctgcat attttatttg tgtgcataac cccaaggcag 2160 
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tggcagggaa gtctaataaa tgaggcaaaa taaaagagct tcacctttta aaaaaa 2216 

<210> 34 

<211> 1995 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7472002CB1 

<400> 34 

atgaccgaaa aaaccaatgg tgtgaagagc tccccagcca ataatcacaa ccatcatgca 60 
cctcctgcca tcaaggccaa tggcaaagat gaccacagga caagcagcag gccacactct 120 
gcagctgacg atgacacctc ctcagaactg cagaggctgg cagacgtgga tgccccacag 180 
cagggaagga gtggcttccg caggatagtt cgcctggtgg ggatcatcag agaatgggcc 240 
aacaagaatt tccgagagga ggaacctagg cctgactcat tcctcgagcg ttttcgtggg 3 00 
cctgaactcc agactgtgac cacacaggag ggggatggca aaggcgacaa ggatggcgag 360 
gacaaaggca ccaagaagaa atttgaacta tttgtcttgg acccagctgg ggattggtac 420 
tactgctggc tatttgtcat tgccatgccc gtcctttaca actggtgcct gctggtggcc 480 
agagcctgct tcagtgacct acagaaaggc tactacctgg tgtggctggt gctggattat 540 
gtctcagatg tggtctacat tgcggacctc ttcatccgat tgcgcacagg tttcctggag 600 
caggggctgc tggtcaaaga taccaagaaa ctgcgagaca actacatcca caccctgcag 660 
ttcaagctgg atgtggcttc catcatcccc actgacctga tctattttgc tgtggacatc 720 
cacagccctg aggtgcgctt caaccgcctg ctgcactttg cccgcatgtt tgagttcttt 780 
gaccggacag agacacgcac caactaccct aacatcttcc gcatcagcaa ccttgtcctc 840 
tacatcttgg tcatcatcca ctggaatgcc tgcatctatt atgccatctc caaatccata 900 
ggctttgggg tcgacacctg ggtttaccca aacatcactg accctgagta tggctacctg 960 
gctagggaat acatctattg cctttactgg tccacactga ctctcactac cattggggag 1020 
acaccacccc ctgtaaagga tgaggagtac ctatttgtca tctttgactt cctgattggc 1080 
gtcctcatcfc ttgccaccat cgtgggaaat gtgggctcca tgatctccaa catgaatgcc 1140 
acccgggcag agttccaggc taagatcgat gccgtgaaac actacatgca gttccgaaag 1200 
gtcagcaagg ggatggaagc caaggtcatt aggtggtttg actacttgtg gaccaataag 12 60 
aagacagtgg atgagcgaga ciattctcaag aatctgccag ccaagctcag ggctgagata 1320 
gccatcaatg tccacttgtc cacactcaag aaagtgcgca tcttccatga ttgtgaggct 1380 
ggcctgctgg tagagctggt actgaaactc cgtcctcagg tcttcagtcc tggggattac 1440 
atttgccgca aaggggacat cggcaaggag atgtacatca ttaaggaggg caaactggca 1500 
gtggtggctg atgatggtgt gactcagtat gctctgctgt cggctggaag ctgctttggc 1560 
gagatcagta tccttaacat taagggcagt aaaatgggca atcgacgcac agctaatatc 1620 
cgcagcctgg gctactcaga tctcttctgc ttgtccaagg atgatcttat ggaagctgtg 1680 
actgagtacc ctgatgccaa gaaagtccta gaagagaggg gtcgggagat cctcatgaag 1740 
gagggactgc tggatgagaa cgaagtggca accagcatgg aggtcgacgt gcaggagaag 1800 
ctagggcagc tggagaccaa catggaaacc ttgtacactc gctttggccg cctgctggct 18 60 
gagtacacgg gggcccagca gaagctcaag cagcgcatca cagttctgga aaccaagatg 1920 
aaacagaaca atgaagatga ctacctgtct gatgggatga acagccctga gctggctgct 1980 
gctgacgagc cataa 1995 

<210> 35 

<211> 988 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 1812692CB1 

<400> 35 

cttgggtgaa agaaaatcct gcttgacaaa aaccgtcact taggaaaaga tgtcctttcg 60 
ggcagccagg ctcagcatga ggaacagaag gaatgacact ctggacagca cccggaccct 120 
gtactccagc gcgtctcgga gcacagactt gtcttacagt gaaagcgact tggtgaattt 180 
tattcaagca aattttaaga aacgagaatg tgtcttcttt accaaagatt ccaaggccac 240 
ggagaatgtg tgcaagtgtg gctatgccca gagccagcac atggaaggca cccagatcaa 300 
ccaaagtgag aaatggaact acaagaaaca caccaaggaa tttcctaccg acgcctttgg 360 
ggatattcag tttgagacac tggggaagaa agggaagtat atacgtctgt cctgcgacac 420 
ggacgcggaa atcctttacg agctgctgac ccagcactgg cacctgaaaa cacccaacct 480 
ggtcatttct gtgaccgggg gcgccaagaa cttcgccctg aagccgcgca tgcgcaagat 540 
cttcagccgg ctcatctaca tcgcgcagtc . caaaggtgct tggattctca cgggaggcac 600 
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ccattatggc 
ttcagaggag 
caccctcatc 
cacatggaag 
gccagccaga 
tgaaaacaag 
ttcattaaaa 



ctgatgaagt 
aatattgtgg 
aggaattgcg 
aaagaccatg 
tcatggggaa 
gatgacgtac 
tgtgttctca 



acatcgggga 
ccattggcat 
atgctgaggt 
gcatgggcct 
gtctgccttt 
ctaattaact 
gcaatctc 



ggtggtgaga 
agcagcttgg 
accggtggga 
gtggcctgaa 
caaggagtgc 
gctgggaaag 



gataacacca 
ggcatggtct 
caggaggagg 
ccctggggct 
ctttgggacc 
agttaacaat 



tcagcaggag 660 
ccaaccggga 720 
tctgctaggt 780 
ctgtgatgga 840 
ttaaaggaat 9 00 
gaatgttttg 960 
988 



<210> 36 
<211> 3179 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 3232992CB1 



<400> 36 

gcggagcggc 

ggggcgcggg 

tcgggcagcg 

gcagctactg 

gcggggggca 

acaccgtggc 

ccggcctgct 

acctgggcga 

ccgtcacctt 

gggggctggt 

acctcttcac 

tgggcagtgg 

actgggcatt 

tggtcccagc 

cctcatggct 

tggccacgtc 

tgcaccgcgc 

ccaaggacag 

cgggggcagg 

gtgccgtggg 

gcagcatcgt 

gggccatcac 

tggccttgca 

gctttatctc 

gcctgggcta 

tcgccactgc 

tggcgatgcc 

cacactccca 

aagctttctg 

gccctggcat 

ccacacggtt 

aagaagacag 

cttgggcaag 

acttgctggg 

agagcaggtg 

ggtatgcagg 

gcaaatgaag 

tcgggttggc 

caggggtgaa 

ggcaggtgca 

aggaccccac 

ccattcccag 

cacaaagctt 

acactggctg 

cccatccatg 

ggggagctta 

tggtgcgggg 

agcttggaca 

taagcaggtg 



ggcgccggcg 
gcgcgggcgg 
taaggcgggc 
caaaggggcc 
agccgccgcc 
aggcgtcctt 
gcagtc^gtg 
ccgcttcaac 
ctccagctcc 
gggcatcggg 
caagaacacg 
cctgggctac 
gcgggtgtcc 
cactaaaagg 
ccgagatatg 
ggctgtctcc 
ccaagttgtg 
cctcatcttt 
agccacgcgc 
catgctgggc 
aggagcctat 
tgcagacatc 
gagcttcacc 
agacctgatc 
cgcgctcatg 
gctcttcttc 
gcccgcatct 
cctcgtctgg 
tgtgatccac 
caagaggagg 
ggacaggttc 
ccccaagtgg 
tccctgccct 
caaagcacga 
gcccaggcct 
gaccactgct 
ctgggcgccc 
tcccagcctg 
ggccctggct 
caccagccca 
tccttccccg 
aatccatggg 
cctgccccag 
ctggcattcc 
cacacaccag 
gccccctgcg 
gtggggtggg 
atgctcttct 
gaatactcac 



ccggggggcg 
cgctggagtc 
cccgaccgga 
ccggcgctca 
atcctcagct 
ctggacatcc 
ttcatctgta 
aggaaggtga 
ttcattcccc 
gaggccagct 
cgtacgctca 
attactggct 
cctgtcctgg 
ggtcatgccg 
aaggccctga 
ttcgccacgg 
cagaagacag 
ggggccatca 
tggtgccgcc 
tctgccatct 
atctgtatct 
ctcatgtacg 
tcccacctgc 
cgccagagca 
ctctgccctt 
gtcagcgacc 
gtgaaagtct 
gaggtgtcct 
ggctaggcac 
ctgtgtcctc 
ccagccctag 
gtgtccgggg 
ccctggaacg 
tctgcagctt 
cagggcggca 
cagctgggcc 
aagtctctgg 
gaggtcccag 
gcagctgtac 
actctgcagg 
aggctgagct 
gcagtagcca 
agctgaggct 
accaagtgac 
gatgcagctg 
tcacccactg 
gggtgaggcc 
tgccccttag 
ccaccaagct 



cagcgagggg 
tcggccgcgg 
ccccccggca 
gcagcGcaaa 
tgggcaacgt 
agcagcactt 
gcttcatggt 
ttctcagctg 
agcagtactt 
actccaccat 
tgctgtccgt 
ccagcgtgaa 
gcatgatcac 
ac cage t egg 
ttcgaaaccg 
gggccctggg 
cagagacgtg 
cctgctttac 
tgaagaccca 
tcatctgcct 
tcgtcgggga 
tggtcatccc 
tgggggacgc 
ctaaggactc 
tcgtcgtggt 
gcgccagggc 
gaggtggtgc 
acagcgtccg 
ccaccctctc 
agttaccctg 
gtttgggccg 
agagcctggc 
aagggccagg 
tgaagactca 
gtcccggctt 
tcggaccttg 
gtactccctg 
atggggactg 
accacctgtg 
gcttctctcc 
gagccttttc 
gggctccggc 
gaggccccgg 
cccaggggcc 
ccaacttcac 
cctgcacttc 
ttgtggccaa 
ttactggctg 
ctggggtacc 



ctggcggtag 
gcgatgaggt 
cccccggcac 
ccggccagct 
gctcaactac 
tggggtcaag 
ggctgccccc 
cggcattttc 
ctggctgctg 
cgcccccact 
cttctacttc 
gcaggcagcc 
aggaacactc 
ggaccagctc 
cagctacgtc 
catgtggatc 
caacagcccg 
gggatttctg 
gcgggccgac 
gatcttcgtg 
gacgctgctg 
cacgcggcgc 
cgggagcccc 
cccgctctgg 
cctgggcggc 
tgagcagcag 
cattgggaca 
ggaccggctg 
tggcccaggc 
gaaggatgtg 
cagggcccct 
ctgccaccag 
gggctggact 
acagaccctg 
tgaggctcac 
gggatattgg 
gaggacactg 
ttctgacaag 
cccccaggct 
ctgccaccac 
caggggcagg 
tgctggagga 
gagaggcggc 
aggccttcga 
accagcccca 
tgctgcaatc 
tgggggaccc 
gctgtggctt 
ccgagggcct 



cggttgctgc 
gcagacgctg 
ccccggctgc 
tgggccgcgg 
ctggacaggt 
gaccgaggcg 
atcttcggct 
ttctggtcgg 
gtcctgtccc 
atcattggcg 
gccatcccac 
ggagactggc 
atcctcattc 
aaggcccgga 
ttctcctccc 
ccgctctacc 
ccctgtgggg 
ggcgtggtca 
ccactggtgt 
gctgccaaga 
ttttctaact 
gccactgccg 
tacctcattg 
gagttcctga 
atgttcttcc 
gtgaaccagc 
atgaagaacc 
ggctgcccca 
ctgctgagtg 
tgtgttggag 

ggggccaagg 

cttatgtgat 
ttcccacaca 
gaccatacgg 
gcgagggcct 
acgcaacctg 
tctcactgtc 
ctggcatcac 
caaggtctct 
cccccaagcc 
gcccaggaga 
agcagctatc 
ccctacccaa 
tcacccacct 
acccgctttg 
aaggtggttc 
cccaagagcc 
cagtggtgtg 
gacaagagga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 
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tggggtgggg gtggcatcct ccaaagacca 
ccccagcgat gttttcttgt tgtacaagaa 
cggaagccaa actgctcctt tattttttag 
gagagaagcc aaatatattc ctcttgtaaa 

<210> 37 

<211> 1986 

<212> DNA 

<213> Homo sapiens 



gcctccaccc ccactccagc ctcagcgggg 3000 
ccaggtccga gtgttgcctc ctcttccttc 3 060 
agctgctgat tgtgaatctc agagtcttaa 3120 
tgaagaaata aacctattta aatcacaaa 3179 



<220> 

<221> miscjeature 

<223> Incyte ID No: 3358383CB1 

<400> 37 



ggagtatctg agcaaattat ttcttacgtg actttagaga aaacggctac ctatctgacc 60 
Icaaaacgac ttgaggaaac tgtttccacg gtcctgctgc aggggggaag cacagtcgtc 120 
aagaagajag tggggtcagg atcaaaacac atttagtgtg acttagggaa agaaaacatt 180 
?tccctcttt glScltctlt ggatacagtc attttgcctc tacttgagga tcaactgttc 240 
aacctcaatg gcctttcagg acctcctggg tcacgctggt gacctgtgga gfttccagat 300 
5c?tcagact |tttttctct caatctttgc tgttgctaca taccttcatt ttatgctgga 360 
gaacttcact gcattcatac ctggccatcg ctgctgggtc cacatcctgg acaatgacac 420 
tgtctctgac latgacactg gggccctcag ccaagatgca ctcttgagaa tctccatccc 480 
altggactca aacatgaggc cagagaagtg tcgtcgcttt gttcatcctc agtggcagct 540 
ccttcacctg aatgggacSt tccccaacac aagtgacgca gacatggagc cctgtgtgga 600 
tggctgggtg tat|acagaa tctccttctc atccaccatc gtgactgagt gggatctggt 660 
atgtgactct caatcactga cttcagtggc taaatttgta ttcatggctg g^atgatggt 720 
gggaggcatc ctaggcggtc atttatcaga caggtttggg agaaggttcg tgctcagatg 780 
gtgttacctc caglttScca ttgttggcac ctgtgcagcc ttggctccca ccttcctcat 840 
ttSctgctca ctlcgcttct tgtctgggat tgctgcaatg agcctcataa caaatactat 900 
tatgttaata gccgagtggg caacacacag attccaggcc atgggaatta cattgggaat 9 60 
gtgcccttct §gtattgcat ttatgaccct ggcaggcctg gcttttgcca ttcgagactg 1020 
gcltatcctc cagctggtgg tgtctgtacc atactttgtg atctttctga ^ctcaagttg 1080 
gctgctagag tctgctcggt ggctcattat caacaataaa ccagaggaag gcttaaagga 1140 
Icttagalal gctgcacaca ggagtggaat gaagaatgcc agagacaccc taaccctgga 1200 
gattttgaaa tccaccatga aaaaagaact ggaggcagca caaaaaaaaa ^^^cttctct 1260 
gtgtgaaatg ctccacatgc ccaacatatg taaaaggatc tccctcctgt cctttacgag 1320 
Ittticaaac tttatggcct attttggcct taatctccat gtccagcatc tggggaacaa 1380 
tgttttcctg ttgcalactc tctttggtgc agtcatcctc ctggccaact gtgttgcacc 1440 
ttgggcacti aaitalatga cccgtcgagc aagccagatg cgtctcatgt acctactggc 1500 
aatctgcttt atggccatca tatttgtgcc acaagaaatg cagacgctgc gtgaggtttt 1560 
ggcaacactg ggcttaggag cgtcggctct gaccaatacc cttgcttttg cccatggaaa 1620 
tgaagtaatt cccaccataa tcagggcaag agctatgggg atcaatgcaa cctttgctaa 1680 
tf tagcagga gccctggctc ccctcatgat gatcctaagt gtgtattctc cacccctgcc 1740 
ctggltcltc tatggagtct tccccttcat ctctggcttt gctttcctcc tccttcctga 1800 
aaccaggaac aagcctctgt ttgacaccat ccaggatgag aaaaatgaga gaaaagaccc 1860 
cagagScca aa|caagagg atccgagagt ggaagtgacg cagttttaag ^^attccagg 1920 
agctgactgc cgatcaatga gccagatgaa gggaacaatc aggactattc ctagacacta 1980 
gcaaat 



<210> 38 

<211> 3294 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 4250091CB1 



<400> 38 

tgtaagacag gaaagggatc tatttgatgt 
aagctattta gttcctcatc tgttgctttt 
ctcgaatttg caaacacagc catggataca 
ctaagtctgg tttttgttat tcttccctcc 
acatttatgg ttataggttg atttcctaag 
gagggaagaa tgagtagtca ggaactggtc 
acaaggtttt ctacgataaa gcagtttcct 



ctatcttcag atatattggc agttttcctt 60 
tcattttgta tactgcaagt tcccaggcaa 120 
ctatttacct tacagtagtt tcctgggaat 180 
cctccactgc ataatcatgt ataactagca 240 
tgtggctgat ggtagcctct agtttgaagt 300 
actttgaatg tgggagggaa gatattcacg 360 
gcttctcgtt tggcacgcat gttagatggc 420 
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agagaccaag aattcaagat ggttggtggc cagatttttg tagacagaga tggtgatttg 480 
tttagtttca tcttagattt tttgagaact caccagcttt tattacccac tgaattttca 540 
gactatctta ggcttcagag agaggctctt ttctatgaac ttcgttctct agttgatctc 600 
ttaaacccat acctgctaca gccaagacct gctcttgtgg aggtacattt cctaagccgg 660 
aacactcaag cttttttcag ggtgtttggc tcttgcagca aaacaattga gatgctaaca 720 
gggaggatta cagtgtttac agaacaacct tcggcgccga cctggaatgg taactttttc 780 
cctcctcaga tgaccttact tccactgcct ccacaaagac cttcttacca tgacctggtt a^u 
ttccagtgtg gttctgacag cactactgat aaccaaactg gagtcaggta ttttgtactt 900 
tgcagtattt ctcttgtata ccagtttgtg atgttttctc taaaaacttg aagttcctca 960 
ggcctgtaac ttctggaaaa gatgattatt caaaataatg ttttggggta accagtggag 1020 
ttgggtagaa tgaccaaata attattttcc aaactgggat actttttaga gtgaaagggg 1080 
ctattattag gtgggacaaa aggaataaat gaagactgcc cagaaaaaac tgagactatg 1140 
gacattcaaa tcatgggaga aaataatttt gtagattatg ttccattgct aatgaatttg 1200 
acttagaaaa gaattgcctt atttttaaga gattgtttca gtggttcaca taaaggctcg 1260 f 
ctcactggtt tctcttgagt tccttacaca ctatataagt tgttctttca gttttatgat 1320, 
tcaactactg tttttccttc agctgacttt atttttaaac acccttaaag acagatatat 1380; 
ctcatggcaa atttggtatc ctgttacagc cttggctctt aaacaactca aaatattggg 1440 
ataggctgtc agtatgttaa ggatagttgc tcctgagtca attcttcact tactccctct 1500 
gttgttcttg gctggatcct aacgctgatt tccactctgc tgtcacaaac atttttcccc 1560 
ccgtaaaatg tcttaatgct gtcctaccat tattttacca actgtgaaag ctggctttaa 1620 
tttttaggag gaaaagaaaa gcctgcatgt gttctttatt ggtatcattt aaaatatact • 1680 
tttttttttt ttttggtaaa ggtaggcgta ttttaagata ttttcttaac ttgagcagta 1740 
gccaacagga aggataccag tgtctctctc tcttagcgac acactccttg gtcttgctta- 1800 
ccaactggag gacactaggt agaataaccg agtatgacaa ttcttaattg tttacatttt IbbU 
ataacttcct gtccttcaaa agagtttgaa atgtcatttt gggaaaagag agccagtcaa 1920 
gctagtaggc tgattgtgaa gaaaatctaa taccttatct ttatctcaaa cctctgtaca 1980 
actttatttt cattgatggg atactttaac aaaaatgaaa ttttttttgg tttttaaaat 2040 
atgagtgatt atgacctctt tggggatcat gcttcaaaaa gtcagaaacc tagagacaaa 2100 
actgtcattg atttttaaga agaaacacac taggtcaaaa gaagatgtcc tggaaatacg 2160 
aagtactctt taaaaaccat gcatttggag aaagtaattg tttccttgaa aaacatgatt 2220 
aaaaactaaa actgggatgt tcctgtgtgt acacagtgcc aaatggtttt ccctttttat 2280 
gttgtgtttt agaaacagca cgaaagtttt ttccatttta aagtgagaaa acattatatt 2340 
tagacttcca taattccaaa atcagaagct atttttaaaa ttagcatttt cttgcatcac 2400 
caaatggtat tcaattgttt gaagctcaaa atttttacca ttccataaat gtttgtgaat 2460 
ttttagacag tgccaattta aaagtagaga tagccaatct gaatacggtg aaattatggg 2520 
gatctctggt gattgggatg aaaactctgg ccttaaaagg tccactttta gtatataatt 2580 
gcctaattag caatcatttt tattttttgc tcactccctg gtctgaatct atctgtctat 2640 
tcagatattt tttggtaggt ttggaaaatg gagaagtgag cctaattggt gcctaattgt 2700 
ctggtgtatc attcacttta ttcagtttgt tctatcaata tgatttaccc ctcaaggtta 2760 
acctagcagg ttgctcagtt attatctctc aaggtcacag tactagaaat acttggcttg 2820 
catctttcag atgccattca tgttatcaag ctcaaattat agttggtcac aggattctaa 2880 
agtctttatt tgacttctcc tttttgaact ggctcaaatg gaaaagtgta gttgctttta 2940 
aatgttaaaa ataagtttaa actttatatt tcccattggt ttcccctatt ttgtcctttc 3000 
tttgtgtgct tgaaatattt tatttttcag tttgtcctca tagggaatca agtattttag 3060 
ctaggtgatg tcttgcaagt acgttccact ttgttacaat ctactatctg tatatactat 3120 
ttgtatctta attcttttat gagatgttct gtaacatttt tctcactttg acaaatgttt 3180 
ttagactgta cagtcaagat ctggcgcttg ggggtaagtg gaatgatttg ctaatattga 3240 
gaatctgttg tatcaaacat aataaacttt ttttgagatg tgaaaaaaaa aaaa 3294 



<210> 39 
<211> 2043 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> inisc_f eature 

<223> Incyte ID No: 70064803CB1 



<400> 39 

gcaacatggc ggctgccgtg gtgcagcgcc 
cctaccccgg gtgaggggtg gcctccgcgt 
ccccgacatc acgtgtattc cgcacgtccc 
ggcgtgacag ggcccgggtc ccttctcagt 
tctccgggcg cacttccctc gcctgtgttc 
ctcgcaggtg ggatcgtcgg tgggaccgga 
ggccgggtcc gacaccgcgc ccttcctcag 
gcctggcacc ccggggttgc cagggtccac 



cgggctgagc gacagcaagt gcagcgggct 60 
gggatcgtgc cctcttcagc ccgctcctgt 120 
ctccgcgctg tgtgtctact gagacgggga 180 
ggtgctctgt gcttcagggc aagctccccg 240 
ggtccatcct cctttctcca gcctcctccc 300 
gcgcgggcgg gcgcggcccc ccgggaccat 360 
ccaggcggat gacccggacg acgggccagt 420 
ggggaacccg aagtccgagg agcccgaggt 480 
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wo 01/46258 
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cccggaccag gaggggctgc agcgcatcac 
agtggcggtg ctgtgctaca tcaatctcct 
cgtccttccc gacatcgagc agttcttcaa 
gaccgtgttc atctccagtt acatggtgtt 
gtacaatcgg aagtatctca tgtgcggggg 
gtcatccttc atccccggag agcatttctg 
ggtcggggag gccagttatt ccaccatcgc 
cgaccagcgg agccggatgc tcagcatctt 
gggctacatt gcaggctcca aagtgaagga 
ggtgacaccg ggtctaggag tggtggccgt 
gccaagggga gccgtggagc gccactcaga 
ggcagatctg agggctctgg caagaaatct 
agtcctgggt gtgggcctgg gtgtggagat 
ggctgatccc ctggtctgtg ccactggcct 
ccttgcctgc gcccgtggta gcatcgtggc 
cctcctgtcc atgaactggg ccatcgtggc 
ccgacgctcc accgccgagg ccttccagat 
gagcccctac ctcattggcc tgatctctga 
cttgtccgag ttccgggctc tgcagttctc 
gggcggcgca gccttcctgg gcaccgccat 
gctgcacgtg cagggcctgc tgcacgaagc 
ccagcggggc cgctccaccc gcgtgcccgt 
tcacctacct gcacatctgc cacagctggc 
taaccccttg gcctggccca gcttccagag 
actacatggg tagctcaggg gaggaggtgg 
ggcagcccca agggctcggt gctatttgta 
aaa 



cggcctgtct cccggccgtt cggctctcat 540 
gaactacatg gaccgcttca ccgtggctgg 600 
catcggggac agtagctctg ggctcatcca 660 
ggcacctgtg tttggctacc tgggtgacag 720 
cattgccttc tggtccctgg tgacactggg 780 
gctgctcctc ctgacccggg gcctggtggg 840 
gcccactctc attgccgacc tctttgtggc 900 
ctactttgcc attccggtgg gcagtggtct 960 
tatggctgga gactggcact gggctctgag 1020 
tctgctgctg ttcctggtag tgcgggagcc 1080 
tttgccaccc ctgaacccca cctcgtggtg 1140 
catctttgga ctcatcacct gcctgaccgg 1200 
cagccgccgg ctccgccact ccaacccccg 1260 
cctgggctct gcacccttcc tcttcctgtc 1320 
cacttatatt ttcatcttca ttggagagac 1380 
cgacattctg ctgtacgtgg tgatccctac 1440 
cgtgctgtcc cacctgctgg gtgatgctgg 1500 
ccgcctgcgc cggaactggc ccccctcctt 1560 
gctcatgctc tgcgcgtttg ttggggcact 1620 
cttcattgag gccgaccgcc ggcgggcaca 1680 
agggtccaca gacgaccgga ttgtggtgcc 1740 
ggccagtgtg ctcatctgag aggctgccgc 1800 
cctgggccca ccccacgaag ggcctgggcc 1860 
ggaccctggg ccgtgtgcca gctcccagac 1920 
gggtccagga gggggatccc tctccacagg 1980 
acggaataaa atttgtagcc agac^aaaaa 2040 



<210> 40 

<211> 1915 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7 0356768CB1 



<400> 40 

caccactggg cgctgcgcgc tgcccttccc 
ctaatggcag ccggggtctc cctgggacag 
tgctccacgc cctcctgcga tctagaatga 
gcgctgccgg cattggtggg acttttcagt 
cgaccttgca cattcaggaa ttcaccaatg 
tgcccgatca cctagtcctg cttatgtggt 
gcctctttgg agcactgctt gcaggtccct 
tcctggtgaa taacatcttt gtggtgtcag 
caggctcctt tgagatgatc atgctgggaa 
gcatgaacat ccagcccatg tacctggggg 
tggccatgag ctcagccatc tttacggctc 
tcagggagct cctaggtggc cctcaggcct 
ccggggcgct ccagctcgcc tccctgcctc 
ttgactgtgg agacaccgag gcctgcctgg 
agtgctgtgg ctgttcatag gcatgacccc 
ttttgttttt gtttttttgg ttttgtttgg 
agtggtgtga tctcggctcg ctgcagcctc 
tcagcctcct gggtggctgg gactgctggc 
tattattgta ttttttgtaa agatggagtt 
cctgagatca aatgatcctc cccccttggc 
ccattgtatc tggctagcat gggagttttg 
attcctcctt aggcagcctg gtggtccctg 
cttagtgcag acacctgatc tgcctagcgt 
atcctcctgt ctcagcctcc tgagtagctg 
tctccatgct tcttgggtct accctctgag 
ttccttctga agagggcgtt gcacaatgtg 
cctccttcct gcctatagag agttggggca 
agggaggctg aggcgggcag atcacgaggt 
gtgggacccc atctctacta acaatacaaa 



tccgcgcaca ggctgccggc tcaccgcttg 60 
caagacctcc gctcaggccc ctctttcgaa 120 
ttcagggcag gatcctgctc ctgaccatct 180 
ttggctataa cctctctatc atcaatgccc 240 
agacatggca ggcgcgtact ggagagccac 3 00 
ccctcatcgt gtctctgtat cccctgggag 3 60 
tggccatcac gctgggaagg aagaagtccc 420 
cagcaatcct gtttggattc agccgcaaag 480 
gactgctcgt gggagtcaat gcaggtgtga 540 
agagcgcccc taaggagctc cgaggagctg 600 
tggggatcgt gatgggacag gtggtcggac 660 
ggcccctgct gctggccagc tgcctggtgc 720 
tgctccctga aagcccgcgc tacctcctca 780 
cagagacggg ttctcgcttg tccaggctgg 840 
attgttgatc agcacggaag ttttcttctt 900 
gacggggtct cactctgtcg cccaggctgg 960 
cacctcccgg gcccaatcgg ttctcccgcc 1020 
ccgtgccacc acgcttggct aatttttttt 1080 
tcacctcttt gcctgggcag gtctcaaact 1140 
ctcccaaagt gcgtggatta taggcatgag 1200 
aactgtccca tttccaacct gggccagtgc 1260 
ctcctgggat gtcactatat tgatgctgaa 1320 
actgcaaccc agagctcctg ggcccaggcg 1380 
ggactctagg cacacaccac tatgcgtggc 1440 
atgtttttcc ttttctttca ccttccttga 1500 
ctgcttttga tggttgagca aatttctcag 1560 
ggctgggcgc cagctcacgc ctgtaatccc 1620 
caggacatca agaccggcct ggccgacatg 1680 
aattggctgg gtatggtggc acgtgcctgt 1740 
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ggtcccggct gctggggagg ctgaggcggg agagttgctt gggcccggga ggcggaggtt 1800 
gcagtggcgg gagaattgct tggggcccgg gaggcggagg ttgcggtgag ccgagattgt 1860 
gccagtgcac actgcactcc agcctggtga cagagtgaga ctccgtcttc aaaaa 1915 

<210> 41 
<211> 1809 
<212> DNA 

,<213> Homo sapiens 
<220> 

<221> misc^feature 

<223> Incyte ID No: 5674114CB1 

<400> 41 

atgggcctgg ccagggccct 
gcgggcgatg aagaggaggc 
gtgcagtcac ccgtgggccg 
gtgcgtttcg taaacctggg 
tgcgtggacg tgcgctggcg 
tggctgctct tcggcctggc 
ccgccaccgc ccgcgccctg 
gcgctggaga cgcagacgtc 
gccgctgtgg ccgccgtggt 
gtgggtgctg tcatggccaa 
agcgagaacg ccgtcgtggc 
aacctgcgcc gcagccacct 
gtgaccccag agggtgagta 
ggaggcaccg atcgtatctt 
gccagtcctc tgtatgagct 
gtcattctcg aggggatggt 
ctccctggtg aactgctctg 
cagtatgagg tcgactatcg 
tgcagtgcta aggagctgga 
ttccccggct ctctgactgc 
gaggaagatg aggacgatga 
gctagccccc gagttctcac 
atgtcccctt ccccgtgtat 
tctcctggga tgggggcagg 
gtggtaaggt tctgccatgc 
gcttctgtgc tccctcctga 
ctggatcacc caagaggaaa 
attgttgacc agggtgagaa 
ttcttgaaca gcatcagata 
agtgttgctt ctcaactcag 
agggaaggt 

<210> 42 
<211> 1730 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 1254635CB1 

<400> 42 

ctttggccta ttataccatg gatgctaaaa atggttctaa ctgaaaaccc aaaccaagaa 60 
atagcaacaa gtctagaatt cttactacta caaaactcac ctggatccct aagggcacag 12 0 
caaagaatga gctattacgg cagcagctat catattatca atgcggacgc aaaataccca 180 
ggctacccgc cagagcacat tatagctgag aagagaagag caagaagacg attacttcac 240 
aaagatggca gctgtaatgt ctacttcaag cacatttttg gagaatgggg aagctatgtg 3 00 
gttgacatct tcaccabtct tgtggacacc aagtggcgcc atatgtttgt gatattttct 3 60 
ttatcttata ttctctcgtg gttgatattt ggctctgtct tttggctcat agcctttcat 420 
catggcgatc tattaaatga tccagacatc acaccttgtg ttgacaacgt ccattctttc 480 
acaggggcct ttttgttctc cctagagacc caaaccacca taggatatgg ttatcgctgt 540 
gttactgaag aatgttctgt ggccgtgctc atggtgatcc tccagtccat cttaagttgc 600 
atcataaata cctttatcat tggagctgcc ttggccaaaa tggcaactgc tcgaaagaga 660 



39/49 



acgccgcctc agcggcgccc 
cgggcccggg ttgtgccgca 
gcgccgcggt cgcttcgtca 
tggccagggc gcgcgctacc 
ctggatgtgc ctgctcttct 
cttctggctc attgcctcgc 
cttctcacac gtggccagct 
catcggctac ggcgtgcgca 
gctgcagtgc attgccggct 
gatggccaaa cccaagaagc 
gctgcgcgac caccgcctct 
ggtcgaggcc cacgtgcgtg 
catcccgctg gaccaccagg 
cctcgtgtcc cccatcacca 
aggacgtgcc gagctggcca 
tgaggccaca gccatgacca 
gggccatcgt tttgagccag 
ccacttccat cgcacttatg 
tgaacgggca gagcaggctt 
attttgttat gagaatgaac 
gactgaggaa gggaatgggg 
accaaccctg gcgctgaccc 
gcccccttcc ccaaggtagc 
tgttcctgaa taccgacagg 
ctggtgaccc accatggaca 
gaacccttta tgagcctgat 
agactggcag ttctagattc 
gccaatggta tagactgcct 
tcaagagttt gtaggtctgg 
ccaactgagt agcaaatcat 



tggattcggg agacagccgg 60 
acgggtgggc gccggcaccg 120 ' 
agaaagacgg gcactgcaac 180 
tgagcgacct gttcaccaca 240 
cctgctcctt cctcgcctcc 3 00 
tgcacggcga cctggccgcc 360 
tcctggccgc cttcctcttc 420 
gcgtcaccga ggagtgcccg 480 
gcgtgctcga cgccttcgtc 540 
gcaacgagac gctggtcttc 600 
gcctcatgtg gcgcgtcggc 660 
cccagctgct gcagccccgt 720 
atgtggatgt gggctttgat 780 
tcgtccatga gatcgactct 840 
gggctgactt tgagctggtg 900 
cacagtgtcg ctcgtcctac 960 
ttctcttcca gcgtggctcc 1020 
aggtcccagg gacaccggtc 1080 
cccacagcct caagtctagt 1140 
ttgctctgag ctgctgccag 1200 
tggaaacaga agatggggct 1260 
tgcctccatg atgcaaactg 1320 
aagatggagg gatggggctc 1380 
cctgctgggt aaatgactag 1440 
tactggacct taattcctct 1500 
tcctcagtct caccagaatt 1560 
ctctatatgg ggagacctgg 1620 
ctggggaagc aagttggcag 1680 
attcacctaa gattcaaggg 1740 
ttgttctaga ccacctaagg 1800 

1809 



wo 01/46258 



PCT/USOO/35095 



gcccaaacca ttcgtttcag ctactttgca cttataggta tgagagatgg gaagctttgc 720 
ctcatgtggc gcattggtga ttttcggcca aaccacgtgg tagaaggaac agttagagcc 780 
caacttctcc gctatacaga agacagtgaa gggaggatga cgatggcatt taaagacctc 840 
aaattagtca acgaccaaat catcctggtc accccggtaa ctattgtcca tgaaattgac 900 
catgagagcc ctctgtatgc ccttgaccgc aaagcagtag ccaaagataa ctttgagatt 960 
ttggtgacat ttatctatac tggtgattcc actggaacat ctcaccaatc tagaagctcc 1020 
tatgttcccc gagaaattct ctggggccat aggtttaatg atgtcttgga agttaagagg 1080 
ciagtattaca aagtgaactg cttacagttt gaaggaagtg tggaagtata tgcccccttt 1140 
tgcagtgcca agcaattgga ctggaaagac cagcagctcc acatagaaaa agcaccacca 1200 
gttcgagaat cctgcacgtc ggacaccaag gcgagacgaa ggtcatttag tgcagttgcc 1260 
attgtcagca gctgtgaaaa ccctgaggag accaccactt ccgccacaca tgaatatagg 1320 
gaaacacctt atcagaaagc tctcctgact ttaaacagaa tctctgtaga atcccaaatg 1380 
tagtcctaaa ttgcaattat gagggctacc actgaatcat tttatctttc agccaatcaa 1440 
gtcgttgtaa acgtggcttt tttgaaagtg ttatggctat gttttatgat gatgctgggt 1500 
aagtagagta agttaaactt ggtaaaagat aatctaaaaa ttccatagtt ctcagttatt 1560 
aaaatttttc ttgttcgcca attttgtatt aagaatgcta ttaagcctaa ttgattaaaa 1620 
tttatctttt ttattatctt acatgcttgt atcttcagtt ggaggtgtag tattcaaaaa 1680 
cggggasitga aggcaggaag gaggctggaa taaataaaaa taaaatgatt 1730 

<210> 43 
<211> 1147 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> inisc_feature 

<223> Incyte ID No: 1670595CB1 

<400> 43 

gcagctgtct tttccggccc ccgtgcactc tccgcccgag gcggagcccc cggctcgcgg 60 

ggatcgcccc cgagcgctgc gtcctgcggg tgggtcacct aacccatttg tggcttcctc 120 

tacctgtgct cagccatggc cagcgagagc tcacctctgc tggcctaccg gctcctgggg 180 

gaggaggggg ttgccctccc tgccaatggg gccgggggtc ctggaggggc gtctgcccgg 240 

aagctgtcca ccttcctggg tgtggtggtg cccactgtcc tgtccatgtt cagcatagtt 300 

gtttttctga ggattgggtt cgtggtgggt catgctgggc tactgcaggc cctggccatg 360 

ctgctggttg cctacttcat cctggcactc accgtcctct ctgtctgtgc catcgccacc 420 

aatggagccg tgcagggggg cggagcctac tgtatcctcc aacatcgatg gactgggatg 480 

ccacagggcc cagtgggctc cgggtcctgc cccagggcta cggcttggaa cctgctgtat 540. 

ggctccctgc tgctgggcct tgtgggtggg gtctgcacct tgggagccgg cctctatgcc 600. 

cgggcctcat tcctcacatt cctgctggtc tctggctccc tggcctctgt gctcatcagt 660. 

tttgtggctg tggggccgag ggacatccgc ttgactccta ggcctggccc caatggctcc 720 

tccctgccgc cccggtttgg ccacttcacc ggcttcaaca gcagtaccct gaaggacaac 780 

ttgggcgctg gctatgctga ggactacacc acgggagccg tgatgaattt tgccagcgtc 840 

tttgctgtcc tctttaacgg caggcatcat ggctggggcc aacatgtcag gggagctgaa 900 

ggaccccagc cgggcgatcc ctctgggcac gatcgtcgcc gtcgcctaca ccttcttcgt 960 

ctatgccctg cttttctttc tctccagcct cccttcactg gtgccttgat gctaggggcc 1020 

aggcctcctc tgtgactctg ggctacctca gtttccccat tttggccaga ctcaccggcc 1080 

caccggggtg gtgatgtttt cgttctgttt tatttttcta actctgcatg accatgaata 1140 
aaagacc 1147 

<210> 44 
<211> 2745 
<212> D]SIA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 1859560CB1 

<400> 44 

cggcgacgcc agggacccca cgcatcccga gtgaagcaac tagaactcca gggctgtgaa 60 

agccacaggt gggggctgag cgaggcgtgg cctcaggagc ggaggacccc ccactctccc 120 

tcgagcgccg cagtccaccg tagcgggtgg agcccgcctt ggtgcgcagt tggaaaacct 180 

cggagccccg ctggatctcc tggctgccac ccgcaccccc cgccagccta cgccccaccg 240 

tagagatgcc ttcttcggtg acggcgctgg gtcaggccag gtcctctggc cccgggatgg 3 00 

ccccgagcgc ctgctgctgc tcccctgcgg ccctgcagag gaggctgccc atcctggcgt 3 60 

ggctgcccag ctactccctg cagtggctga agatggattt cgtcgccggc ctctcagttg 420 
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gcctcactgc cattccccag gcgctggcct 
atggcctcta ctctgccttc atgggctgct 
atgtgactct gggccccacc gccattatgt 
agcccgccta cgctgtgctg ctggccttcc 
tcctgcgttt ggggttcctg ctggacttca 
ctgctgctgc cgtcaccatc ggctttggac 
tccccaggcc gttcttcctg caggtgtacc 
taggtgacgc cgtcctgggg ctggtctgca 
gggaccacgt gcctcccgtc caccccgaga 
tggtctgggc tgccacgaca gctcgcaacg 
cgtactcctt cgaggtgact ggataccagc 
ggctccctcc agtccggatc ccgcccttct 
tcaccgagat ggtgcaggac atgggagccg 
tggagagcat tgcggtggcc aaagccttcg 
accaggagct gctggccatc ggtctcacca 
cggtcacagg cagctttgga cggacagccg 
cggggggcct ggtgacggga gtgctggtgc 
tctactacat ccccaagtct gccctggctg 
tcgacaccaa gatcttcagg acgctctggc 
gcgtgacctt cctgctgtgc ttctgggagg 
tgtctctgct catgctcctg cactctgcag 
cggttctggt cctgcagccg gccagcggcc 
aggagatcct aagccgggcc ctggaagtgt 
cccatgtctg cagcatcgac tacactgtgg 
tccagaagca gggcgtcgcc ctggcctttg 
tgctgtccgc tgacctgaag gggttccagt 
acctgaggca ggagccaggg acccagccct 
aaaaggttgc cctgctcaag gcataatggg 
tgttccggaa ggttcttgtc actgtgattg 
gcctggctga gaaacccctg agcaggtaac 
gtccacggca gtgggagtgg ggctcactgg 
gctgctgttc cctggcatga ccctctttgg 
gacagactgt gcgaggaagc aggggcaggg 
ggggctggca gcaccttccc ggctcaccag 
aggagtctgg gaggcgggtc cgctcctctt 
accaaggtgt gtcaaatgac gtcaagtctc 
ggaaaaagtg atagctttgg tgattttgta 
aggaaaccac ccctcaccct gtccacttgg 
catatgtacc tgtccgtcag tgtgtggatg 

<210> 45 

<211> 3204 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 5530164CB1 

<400> 45 

cgacctctgg agctactgcg cctgcaagcc 
agctggcgcc gcacaaatac ggggcgggac 
agggcctgag ccgcacagcc cgcccagggg 
gaatggttgc gtgtttccgt gtgtgggtcc 
ctgacaccac cgcccaacta tgaactcatc 
cgccgcgcgc actcgcgcac gagtgagatc 
gcagcctgag actcgacggg agggggtcac 
cactaggaaa gcacctcccc ctttccaaaa 
agatggccgg ggcagtcggc acgagggagg 
gggaagatgg cggatgacaa ggattctctg 
aaccagctgg aaagcctgca gcggcgtgta 
gatggctcgc tgttgtcctc cccgttcctc 
aaactgaggg catcagcagt attgggcttt 
gctcaggcat atgctgtgcc caacgtggag 
cgcaaggggc ccgactagct ctaggtgcca 
caggtggaga cactttatct ggattcccca 
ctgccacctt catccttgcc tcccttcctg 



atgctgaagt ggctggactc ccgccccagt 480 
tcgtgtattt cttcctgggc acctcccggg 540 
ccctcctggt ctccttctac accttccatg 600 
tgtccggctg catccagctg gccatggggg 660 
tttcctaccc cgtcattaaa ggcttcacct 720 
agatcaagaa cctgctggga ctacagaaca 780 
acaccttcct caggattgca gagaccaggg 840 
tgctgctgct gctggtgctg aagctgatgc 900 
tgccccctgg tgtgcggctc agccgtgggc 960 
ccctggtggt ctccttcgca gccctggttg 1020 
ctttcatcct aacaggggag acagctgagg 1080 
cagtgaccac agccaacggg acgatctcct 1140 
ggctggccgt ggtgcccctg atgggcctcc 1200' 
catctcagaa taattaccgc atcgatgcca 1260| 
acatgttggg ctccctcgtc tcctcctacc 1320, 
tgaacgctcai gtcgggggtg tgcaccccgg 1380 
tgctgtctct ggactacctg acctcactgt 1440 
ccgtcatcat catggccgtg gccccgctgt 1500 
gtgttaagag gctggacctg ctgcccctgt 1560 
tgcagtacgg catcctggcc ggggccctgg 1620 
ccaggcctga gaccaaggtg tcagaggggc 1680 
tgtccttccc tgccatggag gctctgcggg 1740 
ccccgccacg ctgcctggtc ctggagtgca 1800 
tgctgggact cggcgagctc ctccaggact 1860 
tgggcctgca ggtccccgtt ctccgtgtcc 1920 
acttctctac cctggaagaa gcagagaagc 1980 
acaacatcag agaagactcc attctggacc 2040 
gccacccgtg ggcatccaca gtttgcaggg 2100 
gatgctggat gccgcctgat agacatgctg 2160 
ccagggaaga gaaggaagcc aggcctggag 2220 
cttcctgtgg gatgactgga aaatgacctc 2280 
aagagtggtt tggagagagc cttctagaat 2340 
gtttccagcc cgggctgtgc gaggcatcct 2400 
tgccacctgc gggggaggga cggggcaggc 2460 
gtctgcggca tctgtgctct ccgagagaaa 2520 
tatttaaaaa taattttgtg ttttctaaat 2580 
aaagtcataa atgcttattg taaaaaatac 2640 
gtgatcattc cagacccctc cccaaacatg 2700 
tatgtttaca gttct ' 2745 



cagcctctct gcgccgcagg ctgcggggcc 60 
acggggcggg acacgggccg gtcccggggg 120 
tggtgcgtgt aaacgggcgt ctggatcccc 180 
gggggaggcc cacgaacgcc agcgaaaccg 240 
aggcgcctga agaccgacac gccgaacatg 300 
atcgcgcccc ggtcgtgagt gcgctcacac 3 60 
gtggaagtat ctgagagagg cgtacttggc 420 
atgctccgga agtgccttcg ccctccgtaa 480 
cggggatgcg cctgcgcaac aagttcggcg 540 
cctaagctta aggacctggc atttctcaag 600 
gaagacgaag tcaacagtgg agtgggccag 660 
aagggattcc tggctggcta tgtggtggcc 720 
gctgtgggca cctgcactgg catctatgcg 780 
aagacattaa gggactattt gcagttgcta 840 
tggaagaggc aggatgagca gctcagcctt 900 
gctgtcatcc atttgctatc tccaactttc 960 
cagattgtgg acagtagttc ctcagcctgc 1020 
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accctggatt ccttcttccc cttcctagct ccatgggact cgccccaaga ctgtggcttc 1080 
aaggaccacc agccccttac tcttcaagcc ctgactgtgg agttggtaga tgcctctgat 1140 
cctcagtatt ctctctggca atgttccacg gcttctcctt cctgggagct ggctccataa 1200 
cttgattttc cccaaacgtg ttgcaatccc tgctgcccct tagccaccca gggtcttgtg 1260 
tgggtatgag tgtagaggat gggggtatgc caggcctggg ccgtcccagg ^aggcccgct 1320 
giaccctgat gctactccta tccactgcca tgtacggtgc ccatgcccca ttgctggcac 1380 
tgtgccatgt ggacggccga gtgcccttcc ggccctcctc agccgtgctg ctgactgagc 1440 
tS?caag?t Ictgttatgc gccttctccc . ttctggtagg ctggcaagca Jggccccagg 1500 
gicccccacc ctggcgccag gctgctccct tcgcactatc agccctgctc tatggcgcta 1560 
acaacaacct ggtgatctat cttcagcgtt. acatggaccc cagcacctac caggtgctga 1620 
gtaatctcaa |attggaagc acagctgtgc tctactgcct ctgcctccgg caccgcctct 1680 
ctgtgcgtca ggggttagcg ctgctgctgc tgatggctgc gggagcctgc tatgcagcag 1740 
ggggccttca Igttcccggg aacacccttc ccagtccccc tccagcagct gctgccagcc 1800 
ccltgcccct gcatatcact ccgctaggcc tgctgctcct cattctgtac tgcctcatct 1860 
caggcttgtc gtcagtgtac acagagctgc tcatgaagcg acagcggctg c^^^^ggcac 1920 
ttcagaacct cttcctctac acttttggtg tgcttctgaa tctaggtctg catgctggcg 1980 
gcggctctgg cccaggcctc ctggaaggtt tctcaggatg ggcagcactc ^tggtgctga 2040 
Iccaggcact aaatggactg ctcatgtctg ctgtcatgaa gcatggcagc agcatcacac 2100 
gcctctttgt ggtgtcctgc tcgctggtgg tcaacgccgt gctctcagca gtcctgctac 2160 
igctgcagct cacagccgcc ttcttcctgg ccacattgct cattggcctg gccatgcgcc 2220 
tgtactatgg cagccgctag tccctgacaa cttccaccct gattccggac cctgtagatt 2280 
gSgcgccacc accagatccc cctcccaggc cttcctccct ctcccatcag cagccctgta 2340 
acaagtgcct tgtgagaaaa gctggagaag tgagggcagc caggttattc tctggaggtt 2400 
ggtggatgaa gggitacccc taggagatgt gaagtgtggg tttggttaag gaaatgctta 2460 
clatScccca cccccaacca agttcttcca gactaaagaa ttaaggtaac atcaatacct 2520 
aggcctgaga aataacccca tccttgttgg gcagctccct gctttgtcct g^^^gaacag 2580 
agttgatgaa agtggggtgt gggcaacaag tggctttcct tgcctacttt agtcacccag 2640 
cagagcclct gga|lt|gct agtccagccc agccatggtg catgactctt ccataaggga 2700 
tcStcaccct tccactttca tgcaagaagg cccagttgcc acagattata caaccattac 2760 
ccaaaccact ctgacagtct cctccagttc cagcaatgcc tagagacatg ctccctgccc 2820 
tctccacagt gctgctcccc acacctagcc tttgttctgg aaaccccaga g^gggctggg 2880 
cttgactcat ctcagggaat gtagcccctg ggccctggct taagccgaca ^tcctgacct 2940 
ctctgttcac cctgagggct gtcttgaagc ccgctaccca ctctgaggct cctaggaggt 3000 
accatgcttc ccactctggg gcctgcccct gcctagcagt ctcccagctc ccaacagcct 3060 
ggggaagctc tgcacagagt gacctgagac caggtacagg aaacctgtag ^tcaatcagt 3120. 
gtctctttaa ctgcataagc aataagatct taataaagtc ttctaggctg tagggtggtt 3180 
cctacaacca cagccaaaaa aaaa 

<210> 46 

<211> 2763 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> mis cofeature 

<223> Incyte ID No: 139115CB1 



<400> 46 

tgcatttgct atgactttga ccggtccact 
ggaagaaact ggcaactaca ctttttcatc 
caaaagcagc ccaatttttg cattccagga 
tctgcagatg gacataagtg gattaattcc 
tattagtgat cactacggac gaaaattccc 
aaccagcgtt tggctctgtt tgctttgcta 
atctaccttc attggtgcat tttgtggcaa 
ctatatagtt gatcagtgta aagaacacaa 
ctttctactt ggacttgtta ctggactaac 
gctaggtttt gagtggtcgt ttctaattat 
tattttattt tttctcggag atccagtgaa 
atgtagtgaa ggcttcaaaa acctatttta 
tggtaagaga cgatttttgc tctgtttgtt 
ggtaattggc attgccccaa tttttatcct 
tgaagttttt ataggttatg gatcagcttt 
aggaatatgg cttttttctt attgtatgga 
taccacgatg acaggaatgg ctatgaccgc 
agccagggtg ccgttccttt tcactattgt 
aaaagtggtt cgttcgactg aacaaggtac 



gacaacgcaa tatgtttatc ggagaatatg 60 
tgatagcaat atttctgagt gtgaaaaaaa 120 
ggaagttcag aaaaaagtgt cacgttttaa 180 
tggtctagtg tctacattca tacttttgtc 240 
tatgattttg tcttccgttg gtgctcttgc 300 
ttttgccttt ccattccagc ttttgattgc 360 
ttataccaca ttttggggag cttgctttgc 420 
acaaaaaaca attcgaatag ctatcattga 480 
aggactgtca tctggctatt ttattagaga 540 
tgctgtgtct cttgctgtta atttgatcta 600 
agagtgttca tctcagaatg ttactatgtc 660 
ccgaacttac atgcttttta agaatgcttc 720 
actttttaca gtaatcactt atttttttgt 780 
ttatgaattg gattcaccac tctgctggaa 840 
gggtagtgcc tcttttttga ctagtttcct 900 
agatattcat atggccttca ttgggatttt 960 
gtttgccagt acaacactga tgatgttttt 1020 
gccattctct gttctacggt ccatgttgtc 1080 
cctgtttgct tgtattgctt tcttagaaac 1140 
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acttggagga gtcactgcag tttctacttt taatggaatt tactcagcca ctgttgcttg 1200 
gtaccctggc ttcactttcc tgctgtctgc tggtctgtta ctacttccag ccatcagtct 1260 
atgtgttgtc aagtgtacca gctggaatga gggaagctat gaacttctta tacaagaaga 1320 
atccagtgaa gatgcttcag acaggtgact gtgatttaaa caaacaaaaa aaatctatga 1380 
atgcacatat catataccat gacttctgaa gactataaat gaattccaca atcagtgctt 1440 
cactgagaac caattttacc tatcttttct tctaaactga acagtcagag agacagctcc 1500 
tggctttagc ttcttgtggt accacgcact ttgagcactt tgtgcgtatc atgcaatata 1560 
cttgcaatac acagaacaaa tttcaaatac gcctcacttt tagacttaga agagaaacat 1620 
taaaacttaa gggtgtaagg agggatcaag aaacttgata aggtcaaaag caataatctc 1680 
tctgacatat tccaggctct tacactgaga ccaaagagaa atctttacct cagtttcttc 1740 
atcagcagaa tgggtttctg gcctctctca gggataattt tgaaggcata atgaaaatta 1800 
tgatgaatca ctcattggta ggaaaataat gatataagtt tcaaatatgt atgattttac 1860 
ctatacttgg taatgctttg ttttatagag cctgttaagc tgctattgat agtcggagct 1920 , 
tatatactgt gacttctgaa gactatacat gaattccaca atcagtgctt tgttgataca 19801 
aaatccttaa aagggaggca ctttaaagaa tatgtatttt tcacttttct taatatgttt 2040 
catcggtgac aggcatgata atatttctat atgtaatggg taattgggaa aaaatagatg 2100| 
ataaataaaa ttgctctaaa gaagttaaaa aactgaatga acagctaata ctggtataaa 2160 
gtaactaatg tttggagcca acatttgttc cttgtgtcag caaaaggata ttcacattcc 2220 
atgatccctg gctgagaatt ctgcctctag tctttcttac ccagctgttg tctatccttg 2280 
ttcaattata aatactgcta agggcatttt taaaatacga tcttgtagtc cttaaatttg 2340 
aatccgtcag cacggtcact cataggaaaa tgatcaaaca agcaagccag tcatgatttg 2400 
actccttccc atctcatttc ttactgcctt acgctcatcc tgaggtccac cttggtctct 2460 
aaaaacacca tgtgttctca tgcctccatg tcttttcaca cactgttcca tttgctcttc 2520 
ctcccacatt acattgaaac tttcaagcct cagtcgaaac attgcttctt ctggatagca 2580 
gccttcttga catccctcct cactccccag tccctacagg gcttccatag ctctttgtgt 254U 
gcacttcgat cccagcattt tccatcgact tgtaattgtt tctgctacct gacaatcatc 2700 
gccttgagta ctgggacaac ctttgattac tcattatatc ctcaataaat atttgttgaa 2760 
eta ^^^^ 



<210> 47 
»<211> 163 9 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 1702940CB1 

<400> 47 

atcgcactga ggcttgagtc tgacttctct 
gatcggggcg ggggttgggg ggcaagcggc 
aactctggtt ctgactgcct gagacatggg 
gatgtgaggc agggtatatc tgggaggccg 
cagctggacg tgggatccac acagctcaga 
ggaagatccc ttggacaaga ggaccctgcc 
gaacgagggt taaggaaaac cttccagtct 
ccctcggtaa cccagccgct ggcaccatga 
accttaagta tttccaggac caagtgagca 
atgaagcctg gaatggattc gtggctgctg 
tccgtaaagc tctgaacaag cttgcaagtc 
ataaagacca gcagcacagg cagtggtttt 
ttgaggatca cataaggaag ctccgtgccc 
gcaccaccat tgccaatgtg gtgtccaact 
tcctcggcct gggtctggca cccttcacag 
gcatgggtct gggagcagca gctgctgtgg 
taaacaaatt gcgggcacga gcccaagccc 
caaaggtgat gaaggagttt gtgggtggga 
attggtacca agtcacacaa gggattggga 
ccaaccctca gttaggagcg tatgccccac 
aaggcggtga acaggttgag agggttgttg 
ccatgatcgt gggtgcagcc actggaggca 
catatgagtc aaagcacttg cttgaggggg 
agcgggctca ggagctggag gggaagctca 
agccaggcca agaccaatga ccccagagca 
caggccagga caaaatgcag actttttttt 
cgctttttcc cccagtaggg gtggcggggc 
ggggggattc gattaacgc 



cccccacctg ctgtgccctt aaactgcaga 60 
tcagatgggt tcaaaaaact ccccaggctc 120 
cagctgacac agcagacctt gaatcctgag 180 
gaggacgtgt ctggttatta cacagatgca 240 
acagttggat cttgctcagt ctctgtcaga 300 
ttggtgtgag agtgagggta gaggaagctg 360 
ggacagtgac tggagagctc caaggaaagc 420 
acccagagag cagtatcttt attgaggatt 480 
gagagaatct gctacaactg ctgactgatg 540 
ctgaactgcc cagggatgag gcagatgagc 600 
acatggtcat gaaggacaaa aaccgccacg 660 
tgaaagagtt tcctcggttg aaaagggagc 720 
ttgcagagga ggttgagcag gtccacagag 780 
ctgttggcac tacctctggc atcctgaccc 840 
aaggaatcag ttttgtgctc ttggacactg 900 
ctgggattac ctgcagtgtg gtagaactag 960 
gcaacttgga ccaaagcggc accaatgtag 1020 
acacacccaa tgttcttacc ttagttgaca 1080 
ggaacatccg tgccatcaga cgagccagag 1140 
ccccgcatgt cattgggcga atctcagctg 1200 
aaggccccgc ccaggcaatg agcagaggaa 1260 
tcttgcttct gctggatgtg gtcagccttg 1320 
caaagtcaga gtcagctgag gagctgaaga 1380 
actttctcac caagatccat gagatgctgc 1440 
gtgcagccac cagggcagaa atgccgggca 1500 
ttttcaagtc tttgacgggg aagggagctc 1560 
ccaactctgg gccgtgtgaa cctcccgggg 1620 
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<210> 48 
<211> 1600 

<212> DNA I 
<213> Homo sapiens 

<220> 

<221> itiisc_feature 

<223> Incyte ID No: 1703342CB1 

ciaaacaacc caggacaggc aggggctgca cgcggtgaag aaaccaagac gcagagaggc 60 
cSalcStt Scttggltc acacagccaa aggaggcaga gccagaactc acaaccagat 120 
cSaaSScaa Sgggllltg gccacctggg acgaaaaggc agtcacccgc agggccaagg 180 ; 
?ggS??cgc SSSggatJ agcaagttct taaggcactt cacggtcgtg ggagacgact 240 | 
Satqcctg gSlatlaal tacaagaaat gggagaatga agaggaggag gaggaggagg 300 
agcagccac? Eccacacca gtctcaggcg aggaaggcag agctgcagcc cctgacgttg 360 

f^t&rtc s?M K 

arikc aSSfa =s 

tlcaSS rc2Sc- cSr.S^g ; g= ?|a| 
ttaaaSt?t SgSitgctg attotgctcc ggctgtggog ggtsgcccgg .tcatcaa^ 780 
SS?S?St StIiSttaai acacgttoag aacggcaact cttaaggtta aaaoagatga 840 
ggcScoSg attoLo.cS ttgagttcg etgctctgag aaggaacaag 900 
SSSSSg IS«L=aal ctattgcgac jacatggact tcttggtgaa Jtgaactag 960 

gSSftgl SSg SSSSS agSft^gl lt^==?=|c? X080 

ill s "11? rS 

bss s'tS t??r^«f ~s 
fa ssbicS ts^s ^rffl r=sss gii 
i™ ffi?s? effis i=s 

gtataaaata aactctctct tttccctgga aaaaaaaaaa 

<210> 49 
<211> 2380 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 1727529CB1 

ctaaaccatg gggggaaagc agcgggacga ggatgacgag gcctacggga ^gccagtcaa 60 

i k b I it-zi^ s?s?rg?g «S| c» 

r~ « rasess isc^i ili i ? p i 

caacatcatc tcagttgctg agaacggcct acagtgcccc acaccccagg tgtgtgtgtc 3bu 
ctcctgcccg gaggacccat ggactgtggg aaaaaacgag ttctcacaga ctgttgggga 4^u 
agtcttc?at IcSaaaaca Igaacttttg tctgccaggg ^taccctgga ^tatgacggt 480 
rrai-caeaaac ctacaacagg aactctgccc cagtttcctc ctcccctctg ctccagccct dsu 
aaaacgSgc ?ttc5a?gga ccaacattac tccaccggcg ctcccaggga tcaccaatga 600 
?l?;a?ca?t cagcaggSga tcagcggtct tattgacagc ctcaatgccc g-g-atcag 660 
frrt-haMatc tttaaacratt ttgcccagtc ctggtattgg attcttgttg cccuggggg^ 
ggcStStc ttgScSac tgtttatctt gcttctgcgc ctggtggctg g^-^^^^ggt 780 
g?tggtgctg atlctgggag tgctgggcgt gctggcatac ggcatctact actgctggg 840 
ggagta?cgS gtgctgcggg acaagggcgc ctccatctcc cagctgggtt tcaccaccaa 900 
cctcagtgcc taccagagcg tgcaggagac ctggctggcc gccctgatcg tgttggcggt Sbu 
gcSSaicc atcctgctgc tggtgctcat cttcctgcgg cagcggattc jtattgccat 1020 
ogccctcctg aaggaggcca gcaaggctgt gggacagatg atgtctacca tgttctaccc iuhu 
a?tggtcacc tttjtcctcc tcctcatctg cattgcctac tgggccatga ctgctctgta 1140 



44749 



wo 01/46258 



PCT/USOO/35095 



cctggctaca tcggggcaac cccagtatgt 
ctgtgagaaa gtgccaataa atacatcatg 
gtgcccaggg ctgatgtgcg tcttccaggg 
tgtcttcaat ctgcaaatct atggggtcct 
ggccctgggc caatgcgtcc tcgctggagc 
gccccaggac atccctacct tccccttaat 
cactgggtca ttggcatttg gagccctcat 
cttggagtat attgaccaca agctcagagg 
gtgctgtttc aagtgctgcc tctggtgtct 
tgcatacatc atgatcgcca tctacgggaa 
catgctactc atgcgaaaca ttgtcagggt 
gctgttcttt gggaagctgc tggtggtcgg 
ctccggtcgc atcccggggc tgggtaaaga 
gctgcccatc atgacctcca tcctgggggc 
tttcggcatg tgtgtggaca cgctcttcct 
cggctccctg gaccggccct actacatgtc 
gaacgaggcg cccccggaca acaagaagag 
gactgcaccc cacccccacc gtccagccat 
tttgtggtaa aaaaaggttt taggccaggc 
tgagaggctg aggcgggcgg atcacctgag 
ggtgaaacct ccgtctctat taaaaataca 

<210> 50 

<211> 3038 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 2289333CB1 

<400> 50 

aggggcaggg aggcgggcac caggcgcggg 
ctgacgccgg ccacgcagcg gcgggagagt 
cgcgagagat ggaagcggcg gcgacgccgg 
tagatatgga tgtaatgagg cccttgataa 
aagaacatga gcaagagctt ctgcctgttc 
gcatttcatt tgtacaaact cttatgcacc 
taggacttcc attggcaata aaaaatgcag 
ttataggaat tatttctgtt cactgtatgc 
gtctgaggtt taaaaagtca acattaggtt 
tgagtccttg gagttgtctt cagaagcaag 
ttctggtgat aacacagctg ggattctgta 
tgaaacaagt tcatgaagga ttcctggaga 
catcaaaccc ttgtgagaga agaagtgttg 
catttataat tcttttggtc ttcattcgtg 
ttgccaacgt ttccatggct gtcagtcttg 
tgccagatcc ccacaacctt ccaatagtgg 
gtactgctgt atttgctttt gaaggcatag 
aagaatcaaa gcgtttccct caagcgttga 
atgtaacatt agctacttta ggatatatgt 
ctttaaatct tccccaagat gtatggttat 
gcatttttgt gacatattca attcagttct 
tcacatccaa atttcatact aaatggaagc 
tggttagtat tacttgtgcc ggagcaattc 
tcgttggagc tgtgagcagc agcacattgg 
ttacattttc gaaggaacat tataatatat 
tcactggagt tgttggcttc ttattaggta 
ctactcccaa agttgtagct ggcactccac 
gcttaacatc tggtttgaaa tagtaaaagc 
ttctgaaaat tatcaagata actagtaaaa 
aactctgttt tctttggcac gatattaata 
agtggtaaac ctatgaaaaa tccttgcttt 
ttctgaaaat tgaaaaaatt aaaatgtaaa 
tcttttatct cagtaagaaa taccttaacc 
ccactcactt gagaacagaa taggatttca 
aacaaeiaagc tctctccaga tcatccctgt 



gctctgggca tccaacatca gctcccccgg 1200 
caaccccacg gcccaccttg tgaactcctc 1260 
ctactcatcc aaaggcctaa tccaacgttc 1320 
ggggctcttc tggaccctta actgggtact 1380 
ctttgcctcc ttctactggg ccttccacaa 1440 
ctctgccttc atccgcacac tccgttacca 1500 
cctgaccctt gtgcagatag cccgggtcat 1560 
agtgcagaac cctgtagccc gctgcatcat 1620 
ggaaaaattt atcaagttcc taaaccgcaa 1680 
gaatttctgt gtctcagcca aaaatgcgtt 1740 
ggtcgtcctg gacaaagtca cagacctgct 1800 
aggcgtgggg gtcctgtcct tctttttttt 1860 
ctttaagagc ccccacctca actattactg 1920 
ctatgtcatc gccagcggct tcttcagcgt 1980 
ctgcttcctg gaagacctgg agcggaacaa 2040 
caagagcctt ctaaagattc tgggcaagaa 2100 
gaagaagtga cagctccggc cctgatccag 2160 
ccaacctcac ttcgccttac aggtctccat 2220 
gccgtggctc acgcctgtaa tccaacactt 2280 
tcaggagttc gagaccagcc tggccaacat 2340 
aaaattagcc 2380 



tcGctccggg caggcgaggt aggcctgggc 60 
gagcactcgg gcggcggcgt cctggagacc 120 
cggctgccgg ggcggcgagg cgcgaggagc 180 
atgagcagaa ttttgatggg acatcagatg 240 
agaagcatta ccaacttgat gatcaagagg 300 
ttcttaaagg aaatattgga actggccttt 360 
gcatagtgct tggaccaatc agccttgtgt 420 
acatattggt acgttgcagt cactttctat 480 
atagtgacac tgtgagcttt gctatggaag 540' 
cagcatgggg gcggagtgtg gttgactttt 600 
gtgtttatat tgtcttctta gctgaaaatg 660 
gtaaagtgtt tatttcaaat agtaccaatt 720 
acctaaggat atatatgctt tgctttcttc 780 
aactaaagaa tctatttgta ctttcattcc 840 
tgataattta ccagtatgtt gtcaggaaca 900 
ctggttggaa gaaataccca ctcttttttg .960 
gagtggtcct tccactggaa aaccaaatga 1020 
atattggcat ggggattgtt acaactttgt 1080 
gtttccatga tgaaatcaaa ggcagcataa 1140- 
atcaatcagt gaaaattcta tattcctttg 1200 
atgttccagc agagatcatt atccctggga 1260 
aaatctgtga atttgggata agatccttct 1320 
ttattcctcg tttagacatt gtgatttcct 1380 
ccctaatcct gccacctttg gttgaaattc 1440 
ggatggtcct gaaaaatatt tctatagcat 1500 
catatataac tgttgaagaa attatttatc 1560 
agagtccttt tctaaatttg aattcaacat 1620 
agaatcatga gtcttctatt tttgtcccat 1680 
tacattgcta tatacataaa aatggtaaca 1740 
ttttggaagt aatcataact ctttaccagt 1800 
taagtgttag caatagttca aaaaattaag 1860 
aaaattaaag aataaaaata cttctattat 1920 
aagatatctc tcttttatgc tactcttttg 1980 
acaataagag aataaaataa gaacatgtat 2040 
gaatgccaaa gtaaacttta tgtacagtgt 2100 
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aaaaaaaaaa aatctcagtt atgtttttat 
gtatagaaaa ctcccattaa cataatataa 
ttttacactc taatggtagt tgatcttcat 
tagtgtttca atgaaatttg acaagggact 
tttcgtcaga ggaaaaggag gcctagaaag 
ttgagatcaa gaaaacctaa tcttctgact 
atgtccaaga aaaagaataa attatgttca 
atattttaat actttgaaaa tgaatgtgtg 
aaatcaggga actccaagaa gcctacactg 
gctatgttct tttaaaacag aatagagacc 
tgtgtgtcta gctatactat ttgtggcttg 
agttttgtag gcaccacatt cctgaatggc 
ttgtggactc tttccagccc tgtggctttt 
gaataaaaga atcagctagg tgtggtggtc 
taagttggga ggatcacttg agaggccagg 
gacctcgact ctataaaaca taaaaaaaaa 

<210> 51 

<211> 2608 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 2720354CB1 

<4D0> 51 

taggctaatt ttttttacag acacgatttc 
tgacctcaag tgatccaccc acctcagcct 
actgcacctg gccaggctca tcactttttg 
ggacattagg gcaggggccc tt cage c tag 
tgacaagcat cacctggaga acgggccagt 
tgcactgtgc agccatagtg taaacaagag 
gaccccctac cctgtctcca gcagcctgtc 
cctgagaagg catagatacc cggcccaccc 
gcctgagtgc ccggcctcct tgggagaccc 
gctgggtgct ctggtcctgc ctgcctgcct 
ttcaggtcca gagcgtggag aagcctcagt 
tcatcaagca agagagcgtg ctgggcctgt 
tcaccttcat caacgcgctg gtgttcgggg 
acgactcgcc cctcaaccag ttcctggcag 
tctgctgccc catggagctg gccaagacgc 
gcacctacaa gggctcgctg gactgcctcg 
gcgtcaaccg gggcatggtg tccacgttgc 
tcctcaccta tgacgctctc acgcgggcgc 
tgcccaagct gctgttggcg ggcggtacgt 
ctgtggacgt ggtcaagtcg cggctgcagg 
gcggcatcct ggactgcgtg caccagagct 
gggggctggc gtccacgctg ctgcgcgcct 
tcacggtggt gctcacctac gcgcgcggcg 
ccgccgcccc tgcggggcct gccctggcgc 
ctccttcccc agggctcctt ctcagaaacc 
ccctgcttcc tgctgggatg ctgcgagctg 
ctgatcagct gggtagtttt ggccgagaac 
ataaggaccc tcatgcccac actgtagagt 
gccagcacct ggcctggctg aggccattgc 
ccagcccctt tctgggatcc tggccacgtc 
gggggtctct gatggccaac caaggggcca 
acccgggggg gtggtgggcc acccctctgg 
tgcctcctgg tgtcacagaa ctggafccctc 
ctaggggtac cccagctgct gccactcctg 
ccgggaagcc atggagtctg ctggaggcac 
caaacaggcc agcggtggag gtctggacag 
aatccttccg cctctgggcc tcaggctgcc 
cggacaactg agtctccggc ccacctacca 
agggctcagc agagaacagt atgggacccc 
aagaagccaa aggtcagtcc ctctgctccc 



tagccaaatt ctaatgattg gctcctggaa 2160 
gcatcagaaa attgcaaaca ctagaattaa 2220 
agtcaagagg cactgttcaa gatcatgact 2280 
ttaaaactta tccagtgcaa ctcccttgtt 2340 
gttaagtaac ttggtcgaga ccactcagcc 2400 
cccaggccag gatgttttat ttctcacatc 2460 
gc ttaatttt agtgttgaat ctatttgatt 252 0 
atttttaata gtatatgtga cctgagcaga 2580 
tggccatata aacctcagca agagaaagaa 2640 
gcttgctggt gaaactcctg gctagtaaga 2700 
agctttttta attattacct tcctttcctg 2760 
agaaaataga cacctcagaa aacggaggat 2820 
cttatcacag ccttttattt attatgagca 2880 i 
tgtgcttata atcccagcta ctctggagga 2940 | 
agcttgagac cagcctgggc agcatagtga 3 000 
aaaaaaaa 3038! 



gccacgttgg ccaggctggt cttgaactcc 60 
cccaaagtgt tgggattaca ggcgtgagcc 120 
cgcctattgc ctcgaagcca gtctctgatg 180 
tctgggacat gggccgctca ctcagcagta 240 
ctcaggaggt cgttcatgcc ccactggcag 300 
gcttaacctg aactggtctg agatcttggg 360 
cctttagctg tttgcctact ggcaccccat 420 
tgccctggaa ttacaaaagt cttagactgt 480 
tcctaggcag cctaagcacc agacccggga 540 
ctcactggac cctctccttc caggtacggc 600 
accgcgggac gttgcactgc ttcaagtcca 660 , 
acaagggcct gggctcgccg ctcatggggc 720 
tgcagggcaa caccctccgg gccctgggcc 780 
gtgcggcggc gggcgccatc cagtgcgtca 840 
ggctgcagct gcaggacgcg ggcccagcgc 900 
cgcagatcta cgggcacgag ggtctgcgtg 960 
tgcgtgagac tcccagcttc ggcgtctact 1020 
tgggctgcga gccgggcgac cgcctgctgg 1080 
caggcatcgt gtcctggctc tctacctatc 1140 
cggacggact gcggggcgcc ccgcgctacc 1200 
accgcgccga gggctggcgc gtcttcacac 1260 
tccccgtcaa cgctgccacc ttcgccaccg 1320 
agrgaggccgg gcccgagggc gaggctgtgc 1380 
agccctccag cctgtgacgc tcaccccgcc 1440 
tgggacataa attggcccct gagtcgattg 1500 
tggagtctat cagatgtggg ctgaattttg 1560 
tgcacttgcc tcagtgttct catctatgaa 1620 
cacgaagctc agagattatt cccagcagca 1680 
accgttatcc tggaaactga ggcagacact 1740 
attgtgctcc tgccctgcag gctggctccc 1800 
cccagggacc tctaactcca cacatcctcc 1860 
tctgtgttag ggacagagga aaacttggtg 1920 
tgcatacccc agcttctcca catgccactg 1980 
ctggagggtg aactggggac cc tgcaccct 2040 
catatcagcc tgcgggacta gggtggggag 2100 
ttcaagtgtg atgcagctgt ggcaaggaga 2160 
tgtccataaa atggggacat ggccagctga 2220 
ccgccagcca ggatccccca aagtgtgcag 2280 
ctcaccaggc ctggaacacc tccagccaca 2340 
cagcaaacgg tgcctcccag gcattctcag 2400 
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tgccagggct tcatccctgt gaaggcacag 
tggggcctgg ggcagaggag ggctgcacca 
gacactgggc tttgcacagc ctggtgtcgc 
tctttgtttt ttaaaaaaaa aaaaaaaa 

<210> 52 

<211> 3804 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 3038193CB1 



ggcctgctag tgggcacagg ggtggctagt 2460 
ggcgtcctgg ggaatgtgct cagtgaagac 2520 
tgtacagaaa ctgtcaaggg aataaagtgt 2580 

2608 



<400> 52 I 

ccctttcctg tcactggcta ctaccactcc caaccctcct caaagccgcc ggagcaaccc 60 
ccaggtcttt actttacaat cggcaatttg acttgctctg ctgcatgtct ggagggacca 120 
aggaaagtgt ggagacgctc caaggattag gtgatcggag cttgaaaaga aaaaaagcca 180 
aacaaataaa caaaacccac ccaccctaac aaatatgagg ctgctggaga gaatgaggaa 240 
agactggttc atggtcggaa tagtgctggc gatcgctgga gctaaactgg agccgtccat 300 
aggggtgaat gggggaccac tgaagccaga aataactgta tcctacattg ctgttgcaac 360 
aatattcttt aacagtggac tatcattgaa aacagaggag ctgaccagtg ctttggtgca 420 
tctaaaactg catcttttta ttcagatctt tactcttgca ttcttcccag caacaatatg 480 
gctttttctt cagcttttat caatcacacc catcaacgaa tggcttttaa aaggtttgca 540 
gacagtaggt tgcatgcctc cgcctgtgtc ttctgcagtg attttaacca aggcagttgg 600 
tggaaatgag ggcatcgtta taacacccct gctcctgctg ctttttcttg gttcatcttc 660 
ttctgtgcct ttcacatcta ttttttctca gctttttatg actgttgtgg ttcctctcat 720 
cattggacag attgtccgaa gatacatcaa ggattggctt gagagaaaga agcctccttt 780 
tggtgctatc agcagcagtg tactcctcat gatcatctac acaacattct gtgacacgtt 840 
ctctaaccca aatattgacc tggataaatt cagccttgtt ctcatactgt tcataatatt 900 
ttctatccag ctgagtttta tgcttttaac tttcatcttt tcaacaagga ataattcggg 960 
tttcacacca gcagacacag tggctatcat tttctgttct acacacaaat cccttacatt 1020 
gggaattccg atgctgaaga tcgtgtttgc aggctatgag catctctctt taatatctgt 1080 
acccttgctc atctaccacc cagctcagat ccttctggga agtgtgttgg tgccaacaat 1140 
caagtcttgg atggtatcaa ggcagaagaa actactccaa accagggggc cactggctaa 1200 
cttgaataat ccagaaggct tggaatatct atccatcaaa tttgggcatt aaaataaata 1260 
ccaagagtcc atcctccagg gagtgaagct gacaaggccg acagtataac aaaggaggtg 1320 
gactttctgt agcaatgtat atatgtacag gattgtacat actagcaatt ctgaagactt 1380 
gtacttgtga atgttgcctc aatgcatatt ttattttttt acacaaaaat atgagatcct 1440 
gtttaagtgc cttaaaatgt atttgacaag agcgttattt ccacaatatg ctttgttgat 150.0 
tactgccagg ggtggtacaa tatttggggg ttaattttgc tttcctaatg caggaatcag 1560 
tcatggtaag tgacaaaaag caaacatgct ttccctgcag cacctttgtg taatacaacc 1620 
ctatagtagt tactgtaatg tttgaaatga ggtcacacca tcaggaaaat gcccttctga 1680 
tgacagtgaa aatttccaaa gtcttattca tgcatacttt gatttactgt gtgattcttt 1740 
ttttctacga ctgtgacatg cctcttcctt atcaactcag caggggtcat agatcgaata 1800 
gatgctgaaa agcgtaagat atatgcattc cttgacatca tttttaaaga cattccttca 1860 
aatagtttcc acacagaaat tcctcactcc cattatgaga gattgtggtt atatgtctta 1920 
aatttattat aagctgcttc aaagaaaggg tctgaatgtt tgaattatga gtgaaatcat 1980 
gtgaaatttt gagttaaact ctgtgatttg attttcaggg tctttaaaat atatcttaat 2040 
atcttcttcc tctttattca ataatttctg tcttgcactt acacactcat aacagccaaa 2100 
tatgaggcac aaaaatgtta caatcagttt gaaagcagca tcaattaatg gtagattcta 2160 
ttcacattcc acaacccaga ccaaattttt ttcctattac gcagatgtgc tgagcacttt 2220 
ccagattgcc cctgttggcc aaaagcagcc tgttacatcc tggaattaag cacacttaag 2280 
gtatttgaga caatttatta atgaaaattt ccttggcaga tttgacaaat gttggcaata 2340 
tttttttaaa agttaaatca tattgctttc atgaataaat gaaaatataa aggtcatgga 2400 
tgcaaacaaa tgttacatat acacattctg tctctccaga tgaaaagaac atgcaaaacc 2460 
atttaataac caaaatatca agtaaaatta gttcccaacg gggcagcagc tttcaaatga 2520 
gtgtccaata tttgcttctg ctatagctgc aagaactgta actggaccca agtagagaat 2580 
gaagccacgt atagaactac gagaacactt ttctgtgttt cccccatgcc gtcctgtcac 2640 
atcctcttac acgtcctctc ttgatttgat agacaatatt ggcatcctgg gtctcactga 2700 
ggccgtgcta tgtcctcagc agctgttttt gttgtttcgt tattatgccc acaacaaaaa 2760 
atcattcctt agaaactcac caagtttatc tactgtgtaa atttatatta ttgttactac 2820 
caggtctcat cttttgtcaa tgtcattgaa taaatttcat aagagttatt ctcagtgtga 2880 
attttaaggc taatgccaga tcctgcaaaa atctatgcta accaggctgt agtacacact 2940 
gttataaaga attttacttg tgtctaaaac tacagtaatt ttgcttaggt aattgtgctt 3000 
acctatggag cacaggaagg ctcttaggtt ttgttcctac aagtttcttt gaattttgga 3060 
gtaaatggaa gtgtctgtct gtctgtcatc tatctgccct atcataaaaa tctttctccc 3120 
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taacattaaa atactgatcc 
gtaggtgtga tcatgggata 
ctttaaaaat gtgctttgtt 
gatatgttta tgcctatttc 
aacaaaaaat aaaatcaaaa 
agcttaaaat attttgttat 
tgtctaaatg tatttatgaa 
acctggattt tttttaatgg 
tattttcatg ttttttctta 
tctgttaaaa attaacacac 
tttcaaaagg attggttaag 
aattatatga ttaaacaaag 

<210> 53 
<211> 1894 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> miscfeature 
<223> Incyte ID No: 3460979CB1 

<220> 

<221> unsure 
<222> 1651 

<223> a, t, c, g, or other 

<400> 53 

acggatcact agtatgcggc gcagtgtgct ggaaagggaa caaacatggc cgctctggcg 60 
cccgtcggct cccccgcctc ccgcggtcct aggctggccg cgggcctccg gctgctccca 120 
atgctgggtt tgctgcagtt gctggccgag cctggcctgg gccgcgtcca tcacctggca 180 
ctcaaggatg atgtgaggca taaagttcat ctgaacacct ttggcttctt caaggatggg 240 
tacatggtgg tgaatgtcag tagcctctca ctgaatgagc ctgaagacaa ggatgtgact 300 
attggattta gcctagaccg tacaaagaat gatggctttt cttcttacct ggatgaagat 360 
gtgaattact gtattttaaa gaaacagtct gtctctgtca cccttttaat cctagacatc 420 
tccagaagtg aggtaagagt aaagtctcca ccagaagctg gtacccagtt accaaagatc 480 
atcttcagca gggatgagaa agtccttggt cagagccagg agcctaatgt taaccctgct 540 
tcagcaggca accagaccca gaagacacaa gatggtggaa agtctaaaag aagtacagtg 600 
gattcaaagg ccatgggaga gaaatccttt tctgttcata ataatggtgg ggcagtgtca 660. 
tttcagtttt tctttaacat cagcactgat gaccaagaag gcctttacag tctttatttt 720, 
cataaatgcc ttggaaaaga attgccaagt gacaagttta cattcagcct tgatattgag 780 
atcacagaga agaatcctga cagctacctc tcagcaggag aaattcctct ccccaaatta 840 
tacatctcaa tggccttttt cttctttctt tctgggacca tctggattca tatccttcga 9 00 
aaacgacgga atgatgtatt taaaatccac tggctgatgg cggcccttcc tttcaccaag 960 
tctctttcct tggtgttcca tgcaattgac taccactaca tctcctccca gggcttccct 1020 
atcgaaggct gggctgttgt gtactacata actcaccttt tgaaaggggc gctactcttc 1080 
atcaccattg cactcattgg cactggctgg gctttcatta agcacatcct ttctgataaa 1140 
gacaaaaaga tcttcatgat tgtcattcca ctccaggtcc tggcaaatgt agcctacatc 1200 
atcatagagt ccaccgagga gggcacgact gaatatggct tgtggaagga ctctctattt 1260 
ctggtcgacc tgttgtgttg tggtgccatc ctcttcccag tggtgtggtc aatcagacat 1320 
ttacaagaag catcagcaac agatggaaaa gctgctatta acttagcaaa gctgaaactt 1380 
ttcagacatt attacgtctt gattgtgtgt tacatatact tcactaggat cattgcattt 1440 
ctcctcaaac tcgctgttcc attccagtgg aagtggctct accagctcct ggatgaaacg 1500 
gccacactgg tcttctttgt tctaacgggg tataaattcc gtccggcttc agataacccc 1560 
tacctacaac tttctcagga agaagaagac ttggaaatgg agtccgtgta agaaatcttt 1620 
cttccctctt ccttagccct gaaccctttg nctaacacaa agcagcacag tgtgaatcga 1680 
gccggctggt ctcagcattt cgtggctgca ggggtgggtc ctctatattt agcagaaggg 1740 
accggcactg gagcccaagg ggtcggtctg gttgaaggca agatttggca accatactgg 1800 
gctgtgccgg aaaaggaaag ggggggccaa aaaacaattg gggccggcgt caaaaaaccg 1860 
ggcgaacaag agaaaaagcg ggcccaggag aaag 1894 

<210> 54 

<211> 1668 

<212> DNA 

<213> Homo sapiens 

<220> 
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ccgcccccaa cttatctacc 
aaattcaact gaaaatgcta 
ttcaaataat ctttacatag 
ttttttttac acaaattcct 
tttattttta attcatgctt 
gtttatacac tgtaaagcta 
agaaatacat tagattatat 
ttgttacaaa attagatttt 
gtattaaaat ttttgtgggt 
ctctagctaa tgttcagtgt 
tcataaagtg gattatttat 
aatg 



tctattgtct aacacctata 3180 
tgataacatt ttatcgtttg 3240 
tgaactttgg tggcgttagt 3300 
tggcatattt tttcataaag 33 60 
attgggattt aattattcag 3420 
tctgttttat gcatttgttt 3480 
ttatgtttac tcatttttcc 3540 
ttaatgggta ataatgttgg 3600 
tttttaaaat ttttccctat 3660 
ttgtgctaaa taccaaattt 3720 
gatgactgga agatgaaaat 3780 

3804 
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<221> mis cofeature 

<223> Incyte ID No: 74722 OOCBl 

<400> 54 

atgacactgg tttactttcc tccttcaaag 
agtcgcctgg cccaacagtt ggcccaatcc 
cggaccacta tccacggcct ggacaggctg 
ttcgtctggc tgtgcacctt tgtgagtgcc 
ctctccgccc gctacaacgc cgcccacttc 
gtttaccgca taccatttcc ggtcataacg 
cgcctggcgg aggcgaagtc aagattcctg 
ctcttcgagc tgattgtggg cacctacgac 
gagcgattgc gcaaccagcc aacggagctg 
gattttatga cctggcgctg caacgagctg 
tacgactgct gcgagatccg ctcgaagcgg 
aactcgctgg agacggaaga gggcaggcgg 
cgtactgggt cggcgggtcc catgagcgcc 
aagcactggc cggggcacag ggagacgaat 
gagccatttg tgtggcacaa caatccgttc 
gagatcgaac ccgtcatcta cttctatgac 
cgccagtgcg tcttcgatga tgagcacaac 
gtttacatga ttgaaaactg tcagtccgag 
aactgcacaa tggacctact gtttccaccg 
gagaaggagt tcgttcgcaa ccaatttcag 
tactccctca actacatcag cgatgtccgg 
aacaactcct atgtggacct ggatgtgcac 
accagcctcg tcttcggctg ggtggactta 
tttcttggct gctccctaat tagtggcatg 
ccggcctttg ggctggatgg actgcgtcga 
ggcgtaaccg tgcccacgcc cactttgaac 
gagaactaca ttatgcaact gaaggctgag 
aactggcacc gcataacatt tgctcaaaag 



cttcagcagc agcagcagcc atcgagatcc 60 
tcctggcagc tggccctgcg ctttggcaaa 120 
cttagtgcca aggccagtcg atgggagcga 180 
ttcctgggcg cggtgtacgt ttgcctgatt 240 
cagacggtgg tggatagcac gcggtttccg 3 00 
atctgcaacc ggaatcgcct caactggcaa 360 
gccaacggca gcaactccgc ccagcaggag 420 
gatgcttact tcggtcactt tcagtccttc 480 
ctcaactatg tcaatttcag ccaggtggtg 540 
ctcgcggaat gcctgtggcg ccaccatgcc 600 
cgcagcaaga acggcttgtg ctgggctttc 660 
atgcagctgc tcgatcccat gtggccctgg 720 
ctctccgtgc gtgttctcat ccagcccgcg 780 
gccatgaagg gcatcgatgt catggttacc 840 
ttcgtggccg cgaacacgga gacgaccatg 900 
aacgacaccc ggggagttcg ctccgaccag 960 
agcaaggatt tcaagtcgct gcaaggatac 1020 
tgccatcagg agtacttggt gcgctattgc 1080 
gacctgctca tctactccca caatcccggc 1140 
ggaatgtcct gcaagtgctt ccgcaactgc 1200 
cccgccttcc tgccaccgga tgtgtacgca 1260 
tttcgcttcg agaccattat ggtctatcgc 1320 
atggttagct ttggaggaat tgccggtctt 1380 
gaactggcct atttcctgtg cattgaggtg 1440 
aggtggaagg ctcgacggca gatggatctg 1500 
tttcaacaaa ccacgcccag tcagctgatg 1560 
aaggcgcaac agcagaaggc gaactttcaa 1620 
catgttattg gcaagtga 1668 
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